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FOREWORD 



The Computer and Information Science Research Center at The Ohio 
State University Is a research organization which consists primarily 
of faculty, staff and gradxiate students of the Department of Computer 
and Information Science* Some of the research activities are performed 
In conjunction with other University departments as well as off-campus 
organizations. 

This publication contains the abstracts of research which has 
been carried on during the 1973-74 academic year. This research has 
been supported la part by grants from governmental agencies as well 
as by The Ohio State University. Sponsorship with government agencies 
and with other units on the campus is identified at the end of an 
abstract. 

A bibliography of the research reports published by the Center 
is included in this publication as Appendix F. Copies of some of 
these reports are still available on a complimentary basis from the 
Computer and Information Science Research Center, The Ohio State 
University, 2024 Neil Avenue, ColuiAus, Ohio, 43210. Titles with 
PB numbers may be obtained from The National Technical Information 
Center, The U.S. Department of Commerce, 5285 Port Royal Road, 
Springfield, Virginia, 22151, in paper copy, magnetic tape, or 
microfiche. There Is a nominal charge for their service. 

Marshall C. Yovits 

Director, Computer and 

Information Science Research Center 
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I. THE OHIO STATE UNIVERSITY COMPUTER AND INFORMATION SCIENCE RESEARCH CENTER 



ORGANIZATIONAL STRUCTURE 



The Computer and Inforaation Science Research Center at The 
Ohio State University is an interdisciplinary activity involving 
the faculty, staff and graduate students from the Department of 
Computer and Information Science. Some of the research activities 
are performed in conjunction with other University departments as 
well as off-canpus organizations. The Center alsj interacts closely 
with Battelle Memorial Institute, Chemical Abstracts Service, and 
the Ohio College Library Center, which are adjacent to the Ohio 
State Campus, as well as with a number of other organizations located 
in Columbus, Ohio which are engaged in computer and information science 
research activities, such as Bell Laboratories, Western Electric 
Corporation, and Rockwell International Corporation. Although the 
Research Center and the Department utilize many of the same personnel 
and have the same Director, they are two separate and distinct entities. 



OBJECTIVES OF THE CENTER 

The Computer and Information Science Research Center has the 
following objectives: (1) to develop a broad research program in 
computer and information science; (2) to develop, test, and evaluate 
practical applications of research in computer and information 
science; (') to coordinate and integrate these functions with an 
academic program in computer and information science at The Ohio 
State University, as well as with other disciplines at the University. 
The Center is a focal point for a number of applied information 
processing activities on the campus. 
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SCOPE OF THE PROGRAM 

The program In computer and Information science at The Ohio 
State University has been defined broadly to encompass most of the 
analytical activities frequently considered to be part of this dis- 
cipline. This approach has been chosen because It Is felt that In 
order to generate the needed cc .^apts, foundations, and generalised 
techniques, It Is necessary to examine analytically a number of 
different arecs of computer and Information science. In this way 
a firm empirical and theoretical foundation may ba established for 
generalized computer and Information systems. Such a view commits 
the program not only to the theoretical study of computer Aid Infor- 
mation system, but also to the study of their realization and their. 
Impact on the user. 

Those areas of study which ace emphasized both In the academic 
program and in the research activities of the Center are as follows: 

!• Generdl theory of information 

2. Information storage and retrieval 

3. Theory of automata and theory of computation 
A. Artificial intelligence 

5. Pattern recognition 

6. Computer programming, including systems programming 

7. Theory and processing of programming languages 

8. Digital computer architecture and organization 

9. Numerical analysis 

10. Man-machine interaction and systems 

11. Formal and computational lingv^stics 

12. Management information and systems 

13. Biological information processing 

14. Social, economic, and psychological aspects of Information 
production, processing, and use. 



FACILITIES 

The Computer and Information Science Research Center has a 
Digital Equipment Corporation DEC System-lO computer which was obtained 
with the assistance of a grant from the Office of Computing Activities 
of the National Science Foundation. This computer is a moderate size 
flexible time-sharing computer which is dedicated to research and 
education in the field of computer and information science* This 
computer provides valuable hands-on experience for the faculty and 
students of the Computer and Information Science Research Center and 
permits research activities involving non-standard and innovative 
applications of computers of both a hardware and software nature* 
Some of these research activities which are currently underway are: 

1) Experimental and developmental research in time-sharing and 
multiprogramming systems* 

2) Complex systems simulation research using graphical display 
devices. 
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3) On^^llne information retrieval systems studies. 

4) Hardware modification and Interface studies • 

5) Software modification and development (e.g., FL/1 and 
COBOL compilers). 

6) Man-machine Interaction and psychophysical experiments. 

7) Pattern recognition studies. 

8) Computer simulation of language learning* 

9) Speech analysis and synthesis. 

10) Analysis and synthesis of human locomotion. 

11) Computer data base systems. 

The Research Center also has access to the University Computer 
Centers. They are: Instruction and Research Computer Center; Hospital 
Computer Center; and University Systems Computer Center. 

Included In these centers are an IBM 370/165, an IBM 370/158, 
an IBM 1620 for plotting, and an IBM 7 for analog to digital conversion, 
as well as several IBM 1130 machines, and a number of remote terminals. 

Many specialized facilities and laboratories of the University 
are also available to the staff and students of the Research Center. 
Some of these are the Office of Computer Assisted Instruction, the 
Institute for Research In Vision, Telecommunications Center, Behavioral 
Science Laboratory, Listening Center, Communications and Control 
Systems Laboratory, Mershon Center for Education In National Security, 
and many others. 

# 

The Research Center also Interacts with the Ohio College Library 
Center which Is administratively Independent of the University. Tlie 
Center was formed by the Ohio College Association, and operates a 
common computerized library network connecting the Ohio colleges 
and universities (both private and state assisted) and many points 
outside the state of Ohio. These Include eight regions with 100 
colleges and universities, and 19 public and special llbx^arles 
participating In the system. Two well established national Information 
systems have units on the campus of The Ohio State University. 
The MEDLINE system Is an automated on-line service to access medical 
Journals of the previous three year period. The ERIC (Educational 
Research Information Center) system Is an automated batch system 
to access research reports and Journal literature In the field of 
education. These systems are available to the staff and students. 
Interaction has also been Initiated with The Academy for Contemporary 
Problems, an organization formed Jointly by The Ohio State University 
and Battelle Laboratories, Columbus. 

The University has established a unlverslty-centered->lnformatlon 
system. The Information system, called the Mechanized Information 
Center (MIC) operates as a department of the University Libraries. 
MIC has developed a multl-dlsclp Unary batch-mode Information system 
from machine-readable data bases primarily for thi; campus scientific 
communliy* The Interface to the MIC system Is decentralized as much 
as possible through the existing system of twenty-three libraries 
around the campus which serve specialized publics. MIC acquires 
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data bases from comnerctal so'-'^cea, as well &3 from professional 
societies and governmental agencies. Research activities In MIC 
are directed toward improving the services of MIC to its users through 
software refinement and development. This center works closely with 
the staff and students of the Department and the Center. 

The main service now offered by MIC is a computer-based, currant 
awareness system for helping faculty and students keep up with pr/olished 
information, such as; 

1, articles in the current issues of more than 3,200 journals 

2, unclassified government reports included in the most recent 
issue of Gr>vemment Reports Announcements 

3, books cataloged by the Library of Congress 

A. current conference papers presented at professional meetings 
and conferences held throughout the world. 

In one year's time, a person, through his profile, can scan 
approximately 550,000 items, including A00,000 articles, 50,000 
government reports, and 100,000 books. In addition, retrospective 
service (1968 to the present) is available, which covers several 
million journal citations. 

ACADEMIC PROGRAMS IN COMPUTER AND INFORMATION SCIENCE 

The program at The Ohio State University emphasizes education, 
research, and the professional practice and application of computer 
and information science. The department offers undergraduate and 
graduate degrees through the Ph.D. Statistics showing the growth 
of the department are found in Table 1. 

Organization of The Department of Computer and Information Science 

The Department of Computer and Information Science is a separate 
academic unit located administratively in the College of Engineering, 
operating in part as an interdisciplinary program with the cooperation 
of many other departments and colleges throughout the University. 

Object ives of The Department 

The program at The Ohio State University emphasizes education, 
research, and the professional practice and application of computer 
and information science. The educational program offers undergraduate 
and graduate degrees through the Ph.D. The research activities which 
are a central part of the program consist of a broad conceptual base 
supported by a number of contracts and grants as well as by the 
university. The broad core research program and these other research 
tasks interact to form an integrated framework. 
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Undergraduate ProgranB 

Undergraduate degrees In computer and Information science are 
available to students in the College of Engineering, the College of 
Mathematics and Physical Sciences of the College of the Arts and 
Sciences, and the College of Administrative Science. The particular 
program chosen depends upon the student's interests and career 
objectives . 

The undergraduate program in the College of Engineering leads 
to the degree of Bachelor of Science in Computer and Information 
Science. This program is designed for the student who wants to 
specialize in computer and information science from within an engi- 
neering environment. Hence, the program provides the student with a 
core of computer and information science, mathematics, and engineering 
science. Both depth and breadth in computer and information science, 
are assured by specific required course sequences in several areas 
of engineering and science yet, sufficient flexibility exists so that 
a student can elect a portion of his technical course work in order 
to develop his individual interests. 

There are two undergraduate programs in the College of Mathematics 
and Physical Sciences. These programs lead either to the degree of 
Bachelor of Science or the degree of Bachelor of Arts with a major 
in computer and information science. The programs are cast in a 
liberal arts setting and are similar in content. The Bachelor of 
Science program provides a somewhat more technical and thorough 
education in computer and information science and mathematics while 
the Bachelor of Arts program is somewhat more flexible and provides 
an opportunity to relate computer and information science to some 
other dis^cipline. 

The undergraduate program in the College of Administrative 
Science leads to the degree of Bachelor of Science in Business 
Administration with a major in computer and information science. 
This program is designed for the student that is business oriented 
and desires an education in computer and information science and a 
general education in the administrative sciences. The program's 
objective is not to make a computer specialist out of a student, but 
rather to enable him to recognize the opportunities to use the 
computer in his managerial activities, to know what to expect from 
it, and to know how to communicate effectively with computer specialists 
so that computerized projects will be properly handled from a technical 
as well as a managerial point of view. 

Graduate Programs 

The Department of Computer and Information Science offers graduate 
programs leading to both the Master's and Ph.D. degrees. The graduate 
program leading to the Master's Degree is available in seven options. 

Option I for the student desiring a theoretical foundation in 
computer and information science. 
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Option II for the student speclallUng In Information systems. 

Option III for the student specializing in computer system- . 

Option IV for the student specializing in numerical analysis. 

Option V for the student specializing in operations research. 

Option VI for the student specializing in biomedical information 
processing. 

Option VII for the student specializing in administrative science. 

Each of these options provides a background in several aspects 
of computer and information science, as well as additional mathematical 
sophistication appropriate to the student's interest. Each of the 
options may lead to the Doctoral program in computer and Information 
science, and each may be taken with a thesis option or without a 
thesis option. (See Appendix A for a listing of courses by nunter 
and title.) 

All courses of study at the Master's level require completion 
of a core program in computer and information science » together with 
the required courses specified for one of the options and additional 
courses as specified by the student's adviser. The core program includes 
courses on; Principles of Man-Machine Interaction, Numerical Analysis, 
Data Structures, Advanced Computer Programming, Digital Computer 
Organization, Mathematical Froundations of Computer and Information 
Science, Introduction to Linguistic Analysis, Modem Methods of 
Information Storage and Retrieval, and Advanced Seminar in Computer 
and Information Science. 

The graduate program leading to the Doctoral Degree in Computer 
and information Science is flexible in that it is tailored to the 
particular background and interests of the individual student. 
These interests may lie in any one of the research and instructional 
areas already listed as well as in many other cognate areas. A 
cognate field is defined as a field supporting or closely related 
to the fourteen Departmental fields and is ordinarily specified by 
an integrated program of study in other departments of the University. 

Course Offerings 

Currently there are about 81 courses (each one quarter in length) 
offered by the Department, 22 of which are largely undergraduate 
with the remainder being upper level undergraduate and graduate 
courses. In addition to these courses there are over two hundred 
courses offered by a variety of departments of the University which 
are of interest to our graduate students who are encouraged to take 
these courses. 
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Faculty 

The Department of Computer and Information Science has a full 
time faculty of twenty members at the assistant professor level and 
above* They have a wide range of backgrounds and experience. The 
above faculty Is supplemented by staff who have joint appointments 
with other departments; by staff from other departments who teach 
courses primarily for Computer and Information Science students; and 
by adjunct staff people who are employed In off campus organizations 
who teach courses In the Department of Computer and Information 
Science (see Appendix B). There are currently a total of about 14 
supplemental staff In this category. 



INTERACTION WITHIN THE UNIVERSITY 

Both the Research Center and the Department of Computer and 
Information Science Interact with other departments and research 
programs within the University. This Is essential because of the 
multl-disclp Unary nature of the activities encompassed In this 
field. A number of the academic faculty have joint appointments 
In other departments. Staff members of the Department of Computer 
and Information Science have appointments In the following departments 
and organizations: 

Instruction and Research 
Computer Center 
University Systems Computer 
Center 

University Libraries 



a. 


Mathematics 


g> 


b. 


Psychology 


h. 


c. 


Biophysics 


d. 


Electrical Engineering 


1. 


e. 


Accounting 


f. 


Engineering Graphics 





INTERACTION WIIHIN THE INFORMATION SCIENCE COMMUNITY 



Columbus, Ohio Is one of the major centers for Information science 
and for the transfer of Information In the United States. A number 
of or£,anlzatlons are Involved with the activities of computer and 
information science. This affords an opportunity for students and 
faculty to Interact with appropriate personnel in these organizations. 
Some of these are: 



a. 
b. 
c. 
d. 
e. 

f. 



Chemical Abstracts Service 
Bat telle Memorial Institute 
Bell Laboretories 
City National Bank 
Columbus and Southern Ohio 
Electric Company 
Western Electric Corporation 
Rockwell International 



h. 
1. 



j. 
k. 

1. 



Corporation 
Industrial Nucleonics 
State of Ohio Department 
of Finance; Department 
of Highways 

Highlights for Children 
Colun^us Board of Education 
Ohio College Library Center 
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There are a large number of scientists who cone to Coltimbuc 
In order to visit with the Department and Center and who usually 
present a seminar. (The seminars for the period of this report 
are listed In Appendix C.) The people cover virtually all phases 
of computer and Information science. 

In addition, our people Interact at most of the major technical 
meetings In this country as participants giving papers, assisting on 
panels, as attendees, and as officials. Hardly a major technical 
meeting in the appropriate fields is held without a contribution 
from one or more of the personnel from the Ohio State Computer and 
Information Science Research Center. A list of these activities 
can be found in Appendix D. 

Research efforts of the staff are disseminated to the professional 
community through several publication channels. A list of current 
publications of the Research Center staff is included as Appendix E. 
In addition, the Research Center issues a technical report series 
(see Appendix F). 



DOCTOR OF PHILOSOPHY DEGREE 

Tlie Doctor of Philosophy degree was awarded to the following 
students during 1973-74. 

Name Dissertation 

Paul Sui-Yuen Chan An Investigation of Symetric Radix for 

Computer Arithmetic 

Mark Lee Gillenson The Interactive Generation of Facial 

Images on a CRT Using a Heuristic 
Strategy 



Stephen Philip Hepler 



Use of Probabilistic Automata as Models 
of Human Performance 



Paul Tllng Renn Wang Bandwidth Minimization, Reducibilir/ 

Decomposition, and Triangulation of 
Sparse Matrices 



RECENT TECHNICAL REPORTS 

The Computer and Information Science Research Center began 
publishing a technical report series in 1968. A list of recent 
reports of the research center follows (See Appendix F for complete 
list) , 
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BAUM, R. I.; HSIAO, D. K. A data secure computer architecture. 
July, 1974. 22p. (OSU-CISRC-TR-73-10) 

BIERMANN. A. W.; BAUM, R.; KRISHNASWAMY, R.; PETRY, F. E. Automatic 
^o^raisyithesii reports. October, 1973. 62p. (OSU-CISRCTR- 

73-6) 

COHEN. D.: LIU, M. T. Emulation of computer networks by micropro- 

grammable microcomputers. June, 1974. 17p. (OSU-CISRC-TR-74-4) 

HSIAO, D. K. Logical access control mechanisms in computer systems. 
July, 1973. 24p. (OSU-CISRC-TR-73-4) 

HSIAO. D. K.; KERR, D. S.; MC CAULEY III, E. J. A model for data 

secire iystemi (Part I). February, 1974. 39p. (OSU-CISRC-TR- 
73-8) 

HSIAO, D. K.; KERR, D. S.; NEE, C. J. Context protection and consistent 
control in data base systems (Part I). February, 1974. 28p. 
(OSU-CISRC-TR-73-9) 

KIRSCH, B. M. An improved error diagnostics system for System/ 
360-370 assembler program dumps . June, 1974. 60p. ^ObU-LiaKO- 

TR-74-3) 

REAMES, C. C; LIU, M. T. Variable- length message transmission for 
distributed loop computer networks (Part I). June 1974. 9p. 
(OSU-CISRC-TR-74-2) 

RUSH, J. E.; PEPINSKY, H. B.; LANDRY, B. C; MEARA, N. M.; STRONG, S. M.; 
VALLEY, J. A.; YOUNG, C. E. A computer assisted language analysis 
system. February, 1974. 71p. (OSU-CISRC-TR-74-1) 

WANG, P. T. R. Bandwidth minimization, reducibility decomposition, and 
'triangularization of sparse matrices. September 197'» 162p. 
(OSU-CISRC-TR-73-5) 

WYRICK, T.: HSIAO, D. K. System configuration study on lie interconnection 
of the IBM 370/165, DEC System- 10 and MICRO 1600/21 computer systems. 
October, 1973. 26p. (CSU-CISRC-TR-73-7) 
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II. INFORMATION STORAGE AND RETRIEVAL 



ALGORITHMIC CONTROL OF INDEX TERM SPECIFICITY IN DOUBLE-KWIC COORDINATE 
INDEXING SYSTEMS 

The specificity and nuaiber of index terms covering a particular 
area of subject matter in an index is generally a function of the 
number of articles in that subject area - i.e., the greater the number 
of articles, the greater the specificity and number of index terms 
covering those articles in the index. Algorithms for control of 
index term specificity have been developed and utilized in our most 
recent version of the Double-KUIC Coordinate Indexing System. The 
algorithm which has been programmed utilizes statistical data and 
user specifiable control paramters to select index terms of appro- 
priate specificity from program-generated sets of hierarchically- 
related index phrases having common words as parent nodes in the 
hierarchical trees. The current system is being evaluated in con- 
Junction with other fundamental studies on iiq>roveroent of automatic 
indexing. 

A. E. Fetrarca, W. M. Lay, W. S. Stalcup. (Sponsor: National Science 
Foundation GN 534.1, and GN 27458) 



AUTOMATIC INDEXING OF OUTPUTS FROM RETROSPECTIVE SEARCHING OF COMPUTER- 
READABLE FILES 

Interface programs have been written to provide KWIC and Double- 
KWIC indexes of outputs from retrospective searches of the Ohio 
State University Mechanized Information Center (MIC) data bases. 
This capability, in addition to providing a potentially useful 
service to users of the MIC retrieval system, will be useful for 
ongoing studies of automatic vocabulary control in printed indexes 
derived fiom natural language text and will also be useful for 
contemplated evaluative studies of the MIC retrieval system. 

A. E. Fetrarca, J. H. Hsu. (Sponsor: National Science Foundation. 
GN 27458) 
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AUTOMATIC INFORMATION SYSTEMS AI^ALYSIS THROUGH EVALUATION OF 3IMUUTED 
PERFOBMANCE 

The Auxiliary Storage Simulator (AUXSYM) provides execution 
time commands for between simulation modification. An existing 
information system model can be modified by (1) substituting one 
facility for another, (2) equating idependently defined facilities, 
(3) restructuring the model configuration, and (A) redefining the 
data base component, AUXSYM also provides, as output, a compre- 
hensive set of utilization and queueing statistics for each information 
system resource. The current research uses these modification commands 
and statistics as a basis for fine tuning an original model toi^ards 
optimum system performance, A definition of optimal, or satisficable 
optimal, performance, and appropriate measures must be determined. 
The automatic analysis phase of AUXSYM uses multi-objective mathemat- 
ical programming techniques to evaluate the output of a simulation 
against modeler defined performance criteria. Heuristic procedures 
then decide which modification commands are appropriate to drive the 
model towards optimality 

T, DeLutis, J, S, Chandler, (Sponsor: National Science Foundation, 
GN 36622) 



AUTOMATIC VOCABULARY CONTROL IN PRINTED INDEXES GENERATED FROM NATURAL- 
LANGUAGE TEXT 

Despite the degree of sophistication attained in automatic indexing, 
much remains to be done in the area of automatic vocabulary control 
to minimize the amount of concept (or subject) scattering encountered 
in the indexes so produced. Such scattering is found, for example, in 
the alphamerically ordered set of keywords mill, millenia , miller , 
mil lime ter » million , and mills , where three words related to the 
concept of "mill" are scattered by intervening, unrelated entries. 

Automatic vocabulary control, like manual control, has here- 
tofore been attained through use of r thesaurus (dictionary) appropriate 
to the subject area covered by the index. However, the thesaurus 
approach suffers from problems relating to the size and coverage of 
the dictionaries involved. That is, a thesaurus is often either too 
small for adequate vocabulary control, or too large to conveniently 
store, maintain, or even use efficiently. These problems can be 
significantly reduced by a vocabulary control scheme which is pre- 
dominantly algorithmic rather than dictionary-driven, 

Algori hmlc control can be achieved by use of an appropriate 
stemming algorithm to identify conceptually related words on the 
basis of a common stem, followed by an appropriate recoding algorithm 
to construct the preferred index term from the stem and a suitable 
suffix, if necessary* We have used such an approach to a limited 
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extent to eliminate scattering caused by singular and plural words 
in Double-*KUIC Coordinate Indexes. A number of stewnlttg-recoding 
algorithms are presently being investigated for extension of the 
algorithmic approach to other variant word forms. 

A. E. Petrarca, W. S. Stalcup. (Sponsor; National Science Foundation. 
GN 27458) 



DECISION THEORETIC APPROACH TO AUTOKATIC DOCUMENT CLASSIFICATION 

Given a set of classes and a set of documents to be classified 
into these classes, the sequential algorithm obtains keywords for a 
document and after each keyword computes a new set of aposteriori 
probabilities of the classes. Classes whose probabilities do not 
exceed a given threshold are dropped from consideration. This step 
is repeated until there is either Just one class remaining or the 
entire document has been read. If there is Just one class remaining, 
the document is classified into that class. 

This research attempts to develop a mathematical model of the 
sequential algorithm. Current areas of research are: 

1. development of a decision- theoretic tree to model the process; 

2. development of analytical criteria for dropping classes at 
each stage; 

3. development of a method which will separate the **noise** 
keywords from the good ones; 

4. development of analytical criteria for classification into 
multiple classes. 

Ii. J. White, G. Kar (Sponsor: National Science Foundation, Office 
of Science Information Service. GN 36340) 



INFORMATION PROCESSING SYSTEMS SIMULATION 

The goal of the research is the development of a methodology 
for the performance evaluation of information systems. Initially 
the research Is being restricted to the analysis of the auxiliary 
storage subsystem within the total system, and is an expansion of 
a methodology for auxiliary storage analysis developed by the principal 
investigator. The research is divided into two major items, (a) 
continued verification and extension of the features comprising the 
current system, and (b) formulation of systematic procedures, based 
on a set of performance measures, for design modification and acceptance. 

Currently, the methodology is a set of general design procedures 
capable of evaluating auxiliary storage system designs prior to their 
implementation. It is predicated on the concept of macro- analytic 
design evaluation, that is, the comparative evaluation of alternate 
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syateoB to identify critical design variables. It represents a 
unique capability because it conbines both hardware and software 
^nsiderations in the evaluation process. Further, it allows for the 
analysis of stable systens operating in a batch environment to 
Sna-ic system, requiring real time operations. ^Jtem f) will i«Prove 
thesi capabilities and increase the scope of applicability of the 
methodology. 

The procedures being developed under item (b) are based on the 
performance measures provided via model evaluation. Design modification 
will involve changing system variables for the purpose of improving 
system performance with respect to imposed time and/or cost constraints. 
Design acceptance will deal with the variances in system behavior 
under changes in system parameters, particularly the demands placed 
on the system and the information structure characteristics of the 
data base. 

T. DeLutis. (Sponsor: National Science Foundation. GN 36622) 



N-GRAM KEYTERMS IN AUTOMATIC DOCUMENT CLASSIFICATION 

An n-gram is a string of n characters which can be used as a 
keyterm in automatic document classification or retrieval. Blanks 
may occur within the body of an n-gram; therefore it is possible 
for single n-gram keyterms to span work boundaries and thus incor- 
porate word dependencies within a phrase. Since n-grama overlap 
greatly and the too-frequent ones are assigned little weight, n-grams 
compress storage of keyterms and require no stoplist or stemming 
algorithm. A simple counting operation on neighboring n-grams can 
simulate a parsing routine to equate phrases which differ syntactically 
but are semantically the same. The present research will investigate 
these advantages and compare the relative effectiveness of n-gram 
keyterms to keywords over various automatic classification methods 
and data bases. Evaluation criteria will be storage and time con- 
siderations as well as classification accuracy. 

S. E. James 



PERFORMANCE EVALUATION OF DATA BASE MANAGEMENT SYSTEMS 

Tlie data base characterization facilities of the Auxiliary 
Storage System Simulator (AUXSYM) permit the analyst to view a data 
base in two ways: (1) in terms of the logical organizations of its 
data sets, and (2) in terms of its physical storage organization. 
A data base is characterized by specifying (1) the logical organizations 
of its data sets, (2) its areas of secondary storage (physical address 
spaces), and v ' logical address spaces for its data sets, together 
with a mapping from logical address spaces to physical address spaces. 



15 



The logical organization of a data set is described in terM of the 
file organisation and thfc type of file access. Data base characteri- 
zation allows the analyst to change the nappings between data set 
logical address spaces and physical address spaces and the napping 
of physical address spaces to mounted volumes at simulation tine. 
These characteriztion facilities are being used in conjunction with 
current research on the performance evaluation of data base management 
systems. These facilities are being used in the development of a 
methodology to evaluate alternative Data Base Management System (DBMS) 
features. Input to this methodology are DBMS characterizations. 
Outputs include data set definitions and procedures which are sub- 
sequently incorporated in an AUXSYM model of the particular information 
system being evaluated. 

T. DeLutis, J. Aitken. (Sponsor: National Science Foundation. GN 36622) 



A QUANTITATIVE MEASURE OF CONCEPT SCATTERING FOR EVALUATION OF 
AUTOMATIC VOCABULARY CONTROL TECHNIQUES 

The purpose of introducing automatic vocabulary control in 
printed keyword indexes generated by coiiq>uter from natural- language 
text is to reduce the amount of concept scattering that would other- 
wise be found in such keyword indexes. In order to evaluate the 
effectiveness of any vocabulary control technique In this regard, 
it is desirable to have an objective and quantitative measure of 
concept scattering encountered in keyword indexes subjected to 
varying degrees of vocabulary control. 

Since current indexing theory does not provide any guidelines 
on how to quantify scattering in an index, a scattering measure 
based on analogies between indexing systems and thermodynamic systems 
is proposed. Such analogies apply because an index, like matter at 
given conditions, may be described in terms of the amount of order 
in its component parts (i.e., concepts in an index; molecules in matter). 
For example, in a keyword index produced without vocabulary control 
from natural language text, each entry is sorted (ordered) alphameri- 
cally on its keywords. However, a certain amount of concept scattering 
(i.e., concept disorder) is inevitably also present in such an index. 
That is, although the words (or phrases) have been ordered, the 
concepts which they represent may still be partially disordered. 
This may be seen, for example, in the alphamerically ordered set of 
keywords ski, skid , skidded , skidding , and skied, where the first 
and last keywords refer to the same concept, "skiing", but they are 
separated by three keywords unrelated to that concept. 

Extending the thermodynamic analogy further, the concept scatter- 
ing of an index may be informally defined as concept entropy. Just 
as work expended by a refrigeration unit reduces entropy present in 
an insulated system, work done by a vocabulary control process reduces 
concept entropy present in an index. Furthermore, "absolute zero" 
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l8 meaningful In an indexing system because, in direct correspondence 
to themo dynamics, when no more concept scattisring can theoretically 
be removed from an index, there remains an "ideal index" with zero 
concept entiropy. Thus, the amount of concept scattering present 
within an index can be computed by determining the extent to which 
that index departs from its ideal index. The particular equations 
and units for best quantifying scattering measurements are being 
investigated along with the development of ideal indexes to be used 
for these measurements. 

A. E. Petrarca, W. S. Stalcup. (Sponsor: National Science Foundation 
GN 27458) 



A SEQUENTIAL APPROACH TO AUTOMATIC DOCUMENT CLASSIFICATION 

This research represents a study of a sequential approach to 
automatic document classification. A sequential method has the 
advantage of requiring substantially less computation time than 
other classification techniques, since the entire document to be 
classilied need not be scanned. The classification accuracy appears 
to be comparable to other automatic classification methods, as 75 
to 871 accuracy on the American Institute of Physics' SPIN abstracts 
has been achieved using the sequential method. The variation of 
algorithm parameters has been shown to control accuracy and computation 
time. Automatic keyword selection techniques have been developed 
based on frequency of occurrence criteria. A clustering algorithm 
has been used to generate and check classifications for sample docu- 
ments based on an analytical measurement of the "distance" between 
two classifications. Cross- comparisons between the discrimination 
method of John H. Williams and the sequential method are presently 
being conducted to evaluate the comparative speed, storage requirements, 
and accuracy of each method. A mathematical sequential model is also 
under development which will address the problems of multiple class 
memberships and of a "general class" for tinclassifiable documents. 
A number of data bases have been acquired for further tests which will 
establish the stability of the sequential method in various environments. 

L. J. White, S. E. James, G. P. Jones, G. Kar, D. E. Westbrook 
(Sponsor: National Science Foundation, Office of Science Information 
Service. GN 36340) 
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III. HUMAN INFORMATION PROCESSHG 



HUMAN EXTRAPOLATION OF STRINGS GENERATED BY ORDERED CYCLIC FINITE 
STATE GRAMMARS 

The investigation of huaan behavior in relation to the complexity 
of sequential concepts can be decomposed into the devising of a measure 
of performance and a measure of complexity for sequential concepts. 
In this study ^ learning was measured by errors made in extrapolating 
character strings. The strings used were ordered by structural 
paraikuiters of the grammars used to generate the strings. These were 
the size of the terminal alphabet and the level of ec^edding (chain 
length) of the generating grammars. 

Two experiments were performed using a coiq>uterized implementation 
of a string extrapolation task. In Experiment 1, it was four^d r.hat 
differences in the size of the terminal alphabet and the level .^f 
embedding of the generating grammars produced statistically significant 
differences in the number of trials to last error^ and the number of 
errors made in string extrapolation. In Experiment 2, it was found 
that variations in the chain length of the generating grammars produced 
statistically significant differences in the number of errors made in 
extrapolation, corrected for trials where there was insufficient 
information to correctly continue the strings. Finally, for all 
grammars with 12 characters in their terminal alphabets, a correlation 
of 0.695 was found between the corrected number of errors and the 
chain length of the generating grammars, 

J. L, Beugi R, Ernst 



USE OF PROBABILISTIC AUTOMATA AS MODELS OF HUMAN PERFORMANCE 

This research studied the use of probabilistic automata as models 
of human behavior in a particular task. Subjects were shown repeated 
brief exposures of an Item array and then asked to reproduce the 
array. Exposure duration and number of presentations before reporting 
were varied and were both found to significantly affect the number 
of items correctly reported. 
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Two models were developed that fit the data closely. Two con- 
clusions vere drawn from the first of two models. While subjects 
probably paired items in the stimulu?, recall of the stimulus was 
one item at a time. Also» changes in the nunber of items subjects 
were able to report from one condition to the next were studied with 
the model. In many cases, changes in performance were attributed to 
a change in the probability of acquiring one particular item. 

Two conclusions were also drawn from the second model. The 
probability estimates indicated some grouping of items for study by 
subjects. The effect of the number of items previously acquired on 
the probability of acquiring an additional item was also studied. 
The results showed the number of items previously acquired to have 
a strong effect on this probability. 



S. P. riepler, R. Ernst 
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lY. INFORMATION ANALYSIS 



A GENERAL THEORY OF INFORMATION FLOW AND ANALYSIS 

A generalized framework foi the development of a theory of 
Information flow which permits the analysis and quantification of 
Information has been suggested. In addition to its theoretical and 
conceptual interest, there are major and immediate implications 
for the developr^ent of information systems and networks as well as 
for the general understanding of Information flow, retrieval, and 
transfer. 

Numerous intuitive notions exist about the Interrelationship 
between information and decision-making. At the pragmatic level. 
Information has value to the extent that it is useful as a resource 
for purposeful activity. The primary "purposeful activity" in life 
is decision-making. Hence, information and decision-making are 
inextricably tied together. In our formulation, in fact, information 
is defined in just that way, as being data of value of decision- 
making. 

As a consequence, a measure of the amount of information in a 
data set or message is defined in terms of a quantity called the 
decision state of a deoision'maker*. The decision state is a function 
of the determinism of the decision-maker. We suggest a way ot 
evaluating the decision state quantitatively in terms of a specific 
decision rule. 

We assume that the decision-maker selects probabilities of choice 
for each alternative available that are proportional to the expected 
value of each alternative. The value of the deoisior. state is 
then defined as the summation of the expected values of all the 
possible courses of action we'.ghted by the probability of each 
course of action. A new measure for the information oontained in 
a parttcfular deoieion state is developed: 

I'm I iPia.)}^ - 2, 
i^l ^ 

where m is the number of courses of action available to a decision- 
maker and P(a^) is the probability of each course of action. The 
information is defined in terms of a two-choice deterministic situation 
which we call a "binary choice unit". This measure is universally 
applicable for all information that is concerned with the effectiveness 
of the data upon the recipient. 
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A measure of the amount of information in a data set or message 
can bfe arrived at by computing the difference In the amount of 
Infortiatlon in the decision state before and after receipt of the 
data. That is, the amount of information is arrived at by considering 
the Impact this new data has on the decision-maker's decision state. 
In symbolic terms, 1(D), the amotmt of information in data set D, is 

m . - 1,, 



Where I^^j and are the amounts of information in the decision state 
after and before receipt of the data set. 

The suggested information measure leads to a -aeasure of the 
pragmatic information content of a data set for a parHoular decision- 
maker at a parHoutar point in time. The data acquired, processed, 
stored, and disseminated by an information system may be used, however, 
as a resource by vaHous decision-makers at various points In time. 
Hence, in the design and development of information systems, there 
exists a problem whose level of complexity is an order of magnitude 
above that of the primary problems addressed in this study ~ the 
problem of • quantifying the information contained in a data set in 
terms of its overall usefulness for a range of deotsion-makers over 
a period of time* 

One possible approach to this problem of assigning a number to 
a data set to indicate its composite information content would be 
to start by dptermining the relationship between tht effectiveness 
of a decision-maker. Since what is really desired is some indication 
of the information content of this data set for thi s deci sion-maker 
over a period of time, one may determine some index 1(D) of the 
average information contained in data set D over some period of time. 
Then, if one were to develop a measure of the effectiveness of each 
of the decision-makers for whom this data set serves as a resource, 
it would be possible to formulate an information profile for the 
data set. Such an information profile would indicate the average 
information content of a data set as a function of declslon-maker- 
effectlveness. 

If such a profile could be determined for every data set to be 
stored in an information system, then some number derived from this 
profile could serve as an index of the composite value of this data 
set. This method would be of major Importance for the development 
of a sound procedure for the design of more effective information 
sys terns . 

M. C. Yovlts, J. G. Abllock (Sponsor: National Science Foundation. 

GN 31628) 
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Y. LINGUISTIC ANALYSIS 



PARALLEL RECOGNITION OF FORMAL LANGUAGES 

Cellular Automata and Cellular Logic Syatena are studied In this 
work as parallel recognition devices for formal languages. It Is 
the goal of the project to develop descriptions of the classes of 
languages recognizable "Immediately"' (l«e«; In time essentially 
Independent of the length of the Input string). The classes are 
differentiated by the ''amount of hardware" or quantity of cellular 
space required I as a function of strlng^length. 

For a string X of length n In a regular language R| n modules 
of cellular space are necessary and sufficient. For X of length n 
In a context-free language C| n^ modules are believed sufficient. 
It Is not known If n,^ will be necessary. Work continues on this 
problem. 

Further work will consider recognition cf context-sensitive and 
deterministic context-free languages. 

J. Rothsteln» M. Moshell 



ON THE SYNTACTIC STRUCTURES OF UNRESTRICTED GRAMMARS 

Formal definitions for the syntactic structures of unrestricted 
grammars have been developed. The traditional forms for grammar 
productions give rise to "derivation structures" and "phrase structures", 
two distinct notions of syntactic structure which become Indistinguish- 
able from trees In the context free case. Parallel theories have 
been developed for both kinds of structure. Using the accepted 
definition of structural equivalence for unrestricted derivations. 
It has been proven that two derivations are structurally equivalent 
If and only If they have the same syntactic structure. Structural 
equivalence Is an equivalence r^^latlon over the derivations of a 
grammar, and each equivalence class contains a unique leftmost derivation. 
The relationship between derivation structures and phrase structures 
has been atudied and It has been shown that the two concepts are non- 
isomorphic. 

H. W. Buttelmann 
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GENERALIZED FINITE AUTOMATA ON UNRESTRICTED SYNTACTIC STRUCTURES 



The syntactic structures of unrestricted grsnmars fall Into 
two non-lsomorphlc classes: "phrase structures" and "derivation 
structures". A generallasatlon of finite automata to accept these 
structures has been defined and studied. Corresponding to the Chomsky 
hierarchy of grammars, there Is a hierarchy of acceptors, and for 
both kinds of structures, the type 2 acceptors are tree automata. 
For each type In the hierarchy, the sets of structures recognized 
by the acceptors are just the sets of projections of the structures 
of the grammars, and the languages of the acceptors aro just the 
languages of the grammars. Finally, the set of derivation struccures 
of a recursively enumerable language is recursive. 



U. W. Buttelmann 

A FORMAL THEORY OF THE SYNTAX, SEMANTICS AND TRANSLATION OF PHRASE- 
STRUCTURE LANGUAGES 

A formal definition for a semantics for phrase structure grammars, 
called a phrase structure semantics, has been developed. It is a model 
of the following semantic philosophy: 1) it f ^^^.^-^J,™""^" 

Ing, and 2) the meaning of a phrase is a function syntactic 
stricture and of the meanings of its constituents. A pair (G,S) where 
G is a phrase structure grammar and S is a phrase structure semantics, 
is called a phrase structure language description. The languages of 
psld's are just the recursively enumerable sets. It has been shown 
that for any psld with a type 0 or type 1 grammar, there exists a ps id 
with a context free grammar that defines the same language, with the 
same meanings. Translation is defined on the languages of psld s, 
and it has been shown that the translation function on the languages 
of arbitrary pairs of psld's ib effectively computable. 



H. W. Buttelmann 



SYNTAX-DIRECTED AND SE^JVNTIC-DIRECTED TRANSLATION OF PHRASE STRUCTURE 
LANGUAGES 

The basis of this theory is a model, from ealler work, of language 
description called a phrase structure language description, which 
contains both syntax and semantic information. Certain translation 
algorithms for these languages are being studied, which promise to be 
more efficient than the general algorithm of earlier theory. These 
algorithms are either syntax-directed or semm tic-directed, depending 
on whether they are controlled by syntax Information exclusively, 
or by both syntax and semantic information. A part'cularly fast 
translation algorithm is a special case of the synt. t-dlrected trans- 
lations, where the syntax control information can be specified in a 
simple, finite way. It has been shown that it is possible to use a 
semantic-directed translation scheme as a translator generator, to 
produce the finite specifications of a very fast syntax-directed 
translation, if such a translation exists. 
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Running computer programn of the translator and translator- 
generator have been developed) and the programs are being used to 
develop and test definitions and translations of very simple languages. 

H. W. Buttelmann 



SEMANTIC-DIRECTED TRANSLATION— ANOTHER APPROACH 

Let L be a language and U a universe of discourse. A semantics 
for L Is a specification of a function f: L 2^, which assigns to 
each sentence w of L one or more meanings f(w) In U. We are Investi- 
gating methods for specifying the semantics of context free languages 
so that a partial function f"^: 2^ L exists and Is effectively 
computable under certain conditions. Then a translation t from Li to 
L2 can be effectively computed as t(w) ■ f2" (fi(w)). 

H. W. Buttelmann, A. Pyster 



STUDIES IN DECIDABILITY, EXISTENCE. AND EFFECTIVENESS OF TRANSLATIONS 
ON PHRASE-STRUCTURE LANGUAGES 

A theoretical model for the semantics of phrase structure languages 
is taker from earlier research. The phrase structures of sentences are 
used to compute semantic functions. The model Is applied to a theory of 
translation between phrase structure languages. Issues of the decidability 
of the existence of a translation and effectiveness of a translation are 
studied. Various types of translation are described under different 
constraints, and decidability and effective computablllty Issues are 
examined for each type. Necessary and sufficient conditions for existence 
for various types of translation are being studied. 

H. W. Buttelmann, R. Krishnaswamy 



SOME PROPERTIES OF SYNTAX-DIRECTED TRANSLATIONS 

We are investigating problems relating to the existence and 

computability of finitely specified syntax-directed translations. 

The properties of the set of syntax trees generated by a finite 

generating set of trees, and maps on those sets of trees, are being 

studied, and we are Investigating the following conjecture: Let 

L^ be a phrase-structure language of numbers written in radix a and 

a phrase-structure language of numbers written in radix b. Then 

^J\^}-^^Y specified syntax-directed translation from L to L. exists 
if loba° is rational. a 



H. W. Buttelmann, F. J. Dickey 



24 



THE ROLE OF THE NONTERMINAL IN LANGUAGE GENERATION 

We examine in depth the dlfferencei between gremauirs which allow 
nonterminals and thoee which do not. We do so by Introducing a 
terminal-only grammar » which Is a rewriting eye tern in which non- 
terminals in their usual capacity and status do not exist. 

A hierarchy of grajmnars and languages, parallel to that of 
Chomsky's, is formed and investigated. The properties of the terminal- 
only hierarchy are markedly different from those of the Chomsky 
hierarchy and many results on these differences are proven. In 
addition, we discuss some of the functions of nonterminals and why 
not having them in terminal-only grammars drastically affects the 
generated languages. 

H. W. Buttelmann, A. Pyster 
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VI. ARTIFICIAL INTELLIGENCE 



DECISION THEORETIC MODELS FOR INVARIANT PATTERN RECOGNITION 

The problem of statistical classifier design when pattern 
vectors depend upon variables $ such as orlentatloni external to 
the feature space Is Investigated. The problem arises In designing 
an automatic Identification system for aircraft using low frequency 
radar data. In such a situation It Is desirable for the feature 
selection and decision algorithms to be either Independent of the 
external variables or to utilize any available Information regarding 
them. A nlmlnum error rate has been obtained tislng discriminant 
functions based on statistical averaging over the external variables. 
Eff:lclent methods have been developed for computing Bayes discriminant 
functions when class- conditional probability density functions are 
either mixtures of multivariate normal or Rlclan density functions. 
Treating each discrete orientation as a sub-class , an efficient 
sequential technique based on the generalized sequential probability 
ratio test has been obtained and tested in Monte-Carlo simulations. 
The performance of the sequential classifier has been compared to 
the theoretical performance of a classifier based on majority voting 
over individual decisions. Other sequential and non-parametric 
techniques I and classifier design when samples correspond to different 
orientations are currently being investigated. 

L. J. White, S. N. Srihari (Sponsor: Air Force Office of Science Research 
Grant 69-1710) 



DECISION THEORY FOR PATTERN RECOGNITION 

If p(x) and q(x) are the densities for the N-dimensional measurement 
vector X| conditioned on the classes ci and C2$ and if finite sets of 
samples from the two classes are availablei then a decision function 
based on estimates p(x) and q(x) can be used to classify future 
observations. In general i however , when the measurement complexity 
(the dimensionality I N) is increased arbitrarily and the sets of 
training samples remain finite , a ^^peaking phenomenon** of the following 
kind is observed: classification accuracy improves at firsti peaks at 
a finite value of called the optimum measurement complexity, and 
starts deteriorating thereafter. In this research, we have derived, 
for the case of statistically independent measurements, general conditions 
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under which It can be guaranteed that the peaking phenomenon will not 
occur, and the correct classification probflbJllty will keep Increasing 
to value unity as Vh***. Several applications are considered which 
together Indicate that, contrary to general belief, Independence of 
neasurenents alone does not guarantee the absence of the peaking 
phenomenon . 

B. Chandrasekaran, A. K. Jain (Sponsor: Air Force Office of Scientific 
Research. 722351) 

DISTANCE FUNCTIONS FOR INDEPENDENT MEASUREMENTS AND FINITE SAMPLE SIZE 

It Is generally known that the "further apart" one makes the 
class-condltloual densities, the smaller will be the probability of 
mlsclasslflcatlon determined. The distance functions have also been 
found to be quite useful to the recognition system designer In selecting 
the best subset of features from a given set. Most of the distance 
functions are applicable only when the class-conditional densities are 
known; how these will behave If the estimated densities are substituted 
for the true ones Is generally uncertain. In this research, a distance 
function has been derived when the features are Independent and only 
a finite number of samples Is available to estimate the class-conditional 
densities. 

B. Chandrasekaran, A. K. Jain (Sponsor: Air Force Office of Scientific 
Research. 722351) 

FINITE MEMORY DECISION THEORY 

In this research we Investigate finite memory, deterministic 
algorithms for the symmetric hypothesis testing problem. It Is 
shown how members of a class of Irreducible, deterministic, finite 
memory algorithms can be constructed to give a steady state probability 
of error that decreases asympathlcally faster In the number of states 
than the bebt previously known deterministic algorithm. It is strongly 
believed that the optimal algorithm lies In this class. 

B. Chandrasekaran, C. C. Lam 



GRAMMATICAL INFERENCE PROBLEM 

The problem of grammatical Inference for finite state grammars 
can be approached analytically, and has application in syntactic 
methods for pattern recognition. The current project is implementation 
of a technique proposed by Pao in 1969 which, given a set of strings 
from the target language L, will first construct a machine Mq to accept 
exactly those strings. Next the algorithm constructs a lattice of 
all tho^e machines which can be produced from Mq by merging its states. 
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The algorithm then systematically searches the lattice to find the 
target machine by examining pairs of machines Hi and Using a 
string X accepted by exactly one of Mj, M2 and an oracle to determine 
whether x e one of M^, M2 Is eliminated from consideration and the 
other maintained In the lattice. Work has been done on the storage 
of this latlce and on the construction of a minimum Mq. Currently 
the problem of defining an efficient ''systematic'' search of the lattice 
Is being examined as well as the problem of finding a string x for 
the test between M^^ and M2. Future research plans are to extend the 
solution to context-free grammars 1 and to apply these results to 
patteru recognition problems. 

L. J. White, D. A. Marlk (Sponsor: Air Force Office of Science Research. 
Grant 72-2351) 



INTERACTIVE GENERATION OF A HUMAN FACIAL IMAGE ON A CRT USING A 
HEURISTIC STRATEGY 

Sketching a human face Is a task which Involves spatial decisions 
and a knowledge of aspects of the face that are Important in recognition 
These are talents which non*- artists lack. A man'-machlne system has been 
developed with which a non-artist can create, on a graphic llsplay, any 
male Caucasian facial Image from a photograph In front of him. The 
computer system contains pre-stored facial features, an average face 
used as a starting point, and a heuristic strategy which provides the 
user with the "how-to-do-lt" knowledge necessary. The user makes all 
of the visual decisions and can make changes in individual features or 
hierarchically determined seta of features through analog input devices. 

B. Chandrasekaran, M. L. Gillenson (Sponsor: National Science Foundation 
ON 534. 1 and GJ A1A22X) 



AN INTERACTIVE SYSTEM FOR AUTOMATIC PROGRAM SYNTHESIS 

A man-machine interactive computer programming system has been 
developed which enables a person to work through example calculations 
using a light pen at a CRT and which then automatically synthesizes 
a program for doing those calculations. The system enables the user 
to "show"' a computer how to do a calculation rather than having to 
write a program in the traditional manner. This system, called an 
autoprogrammer, works by storing the complete sequence of actions 
executed during an example computation and then synthesizing the 
smallest computer program which is capable of executing the example. 

In order to reduce the synthesis time, the program synthesis 
algorithm utilizes a number of techniques including preprocessing 
the trace infoimation to reduce enumeration and pruning the search 
using a failure memory. These have proven sufficiently powerful 
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to do complex synthesis problems, for example, a universal Turing 
machine, in less than a minute. The autoprogrammer system has been 
successfully used to produce many programs from interactive calculations 
including a finite state machine minimiaer and a small compiler. A 
ten minute color movie has been made to demonstrate the operation of 
the autoprogrammer system. Work is currently in progress to explore 
the problem of synthesis from weaker input information, in particular 
from traces of memory snapshots. 

A. W. Biermann, R. Krishnaswamy, F. Petry, B. Chandrasekaran. 
(Sponsor: National Science Foundation. GN 534.1 and GJ 3A739X) 



LOW-FREQUENCY RADAR AIRCRAFT DETECTION AND aASSIFICATION 

Previous research has demonstrated the feasibility of using 
multiple low-frequency radar returns for target classification. 
Simple object shapes have been successfully classified by linear 
techniques, but aircraft data poses greater difficulty, as in 
general such data are not linearly separable. Since two parameters 
specify the aspect angle, this data lies on a 2-dimensional surface 
in n-space, where n is the number of frequencies utiliied. A 
bilinear approximation of this surface has provided interpolation 
capability, improved proximity information, and an intersection 
algorithm which determines whether the aircraft data is separable. 
Improved separability together with decreased susceptibility to 
additive noise was obtained by mixing horizontal and vertical 
polarisation data at an optimum polarization angle. Studies have 
indicated the importance of a phenomenon known as bias, where in the 
identification of a test signal as one of two given aircraft, the 
probability of misclassifi cation for one aircraft can be substantially 
larger for one aircraft than the other for high noise levels. Thus 
classification algorithms are currently being developed which jointly 
minimiae the probability of misclassification and the bias effect. 
Another approach being investigated is multipoint classification, 
where the input is a sequence of independent radar measurements of 
the aircraft target to be identified. It has been shown that this 
approach can substantially reduce the effects of noise even for a 
modest sample slae, and a number of classifiers are being investigated 
for this purpose. 

L. J. White (Sponsor: Air Force Office of Science Research. Grant 
69-1710) 



NUMBER TREES, SEMIGROUPS, AND FORMAL LANGUAGES 

Any function f with integer values, defined on all the integers, 
where the value does not exceed the argument, defines a set of trees 
whose nodes correspond biuniquely to the integers. The trees are 
digraphs whose paths lead to "roots", whose Integer labels are 
given by f (n) - n (strictly speaking the tree part of uie graph^is 
obtained by suppressing self-loops, which occur only for such n s). 
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A aet of such functions defines a semigroup of tranaformatlona on 
the Integers I with composition of functions giving trees formed by 
concatenating "tree paths" from s\ib-trees associated with Individual 
functions « An interesting function i whose complexities seem Inex- 
haustlblsi Is provided by Euler^s totlent function, <p(n). All odd 
Integers are "leaves" In the sense that there Is no n such that <p(n) 
Is an odd Integer > !• If p Is a prime and 2p4*l Is not prime thlia 
2p Is a leaf. The "main stem" or trunk of the (^tree consists of 
powers of 2,(<p(2n) * 2^"^) In the sense that the Interated <p-functlon 
carries all Integers Into the trunk (example; 2^3^ joins the trunk 
at 2^ after n Iterations), Vxe number of prlidtlve roots of any 
prime p are given by <p((p(p))« Formal languages can be encoded as 
sets of Integers (e.g. by the m-adlc or other notations) i and gener- 
ative granmarsi Post systems i and Markov algorithms can all be viewed 
as "growing a'nuniber tree from a root", the root being the Integer 
associated with the start symboli axiom, or Input string respectively • 
The number^tree functlo^^s {f}, defined "Inversely" by the production 
or rewriting rules i are then "parsing" operational and a tree-path 
to the root Is equivalent to a derivation i proof i or computation In 
the corresponding formal system or language. As numbers In these 
encodings are vastly smaller than Goedel numbers » they may be useful 
In mechanizing discoveries of proofs In artificial Intelligence i 
In decision problems i or In parsing problems, 

J, Roths te In 



30 



. INFORMATION PROCESSES IN PHYSICAL, BIOLOGICAL AND SOCIAL SYSTEMS 



ON THE ROLE OF POST HOC ERGO PROPTER HOC IN SCIENTIFIC METHODOLOGY 
AND ARTIFICIAL INTELLIGENCE 

The literatures of pattern recognition, artificial intelligence 
and scientific methodology give so much prominence to decision theory, 
statistical or logical, that the need for generating hypotheses from 
which the choices are to be made is often obscured. Work so far on 
trying to discover the mysterious we lisp rings of creativity whence 
come hypotheses, in the sense of seeing how close one can get to a 
formal discovery procedure (probably impossible to do, but this does 
not mean that one can' t program much of the drudgery) , has led to 
the following. The "universe of interest" has a '•history", a des crib- 
able aspect of which is to be singled out as an "initial condition , 
"cause", "specification", "state" or the like, some other aspect 
being the "later condition", "effect", "subsequent state" etc. What 
is sought is a "dynamical law" or the like permitting one to deduce 
the second class of aspects from the first, thereby reducing the 
(empirical) information needed to write a history. Post hoc ergo 
propter hoc (usually dismissed with contempt as a logical fallacy) 
can be understood as a generator of a set of hy potheses which must 
undergo further winnowing (e.g.; by Bayesian inference from new 
experiments). It then becomes a useful tool (instead of a snare and 
a delusion) for iterated experimentation. Creativity comes in at 
the stage of designing new experiments. 



J. Roths tein 
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VIII. MATHEMATICAL TECHNIQUES 



AN ALGORITHM FOR SUB-OPTIMAL SET COVERS WITH CONFIDENCE MEASURES 
OF OPTIMALITY 

An effective heuristic has been developed for the minimum set 
cover problem: mln cx such that Ax^l|X"Oorl|A binary. IWo 
algorithms have been developed: one for the general costs problem 
(arbitrary c vector) ^ and one for the cardinality problem (all c^ 
equal). Both algorithms are Implemented In computer programs In 
FORTRAN and assembler. Test results for the cardinality problem 
exhibit very good performance on test problems from the literature. 
Current work Includes analytic developments for measures of confidence 
of obtaining a global optimum solution^ refinements of the heuristic » 
and testing the general costs program. 

L. J. Whlte» M. E. Doherty 



BANDWIDTH REDUCTION OF SPARSE MATRICES BY ROW AND COLUMN PERMUTATIONS 

It has been shown that there Is an Important class of sparse 
matrices 9 called well r-bandable, for which bandwidth minimization 
can be effected simply and determlnlstlcally . This analysis has led 
to an algorithm 9 called SYMBAND, which determines row and column 
permutations to achieve a significant bandwidth reduction. The concept 
of stability of a bandwidth reduction algorithm has been Introduced. 
Extensive computational results have been obtained Involving random 
symmetric sparse matrices i and the algorithm SYMBAND has been shown 
to be stab let convergent^ and reasonably fast. 

D. S. Kerr, L. J. White, P. T. R. Wang 



BINARY STRINGS AND TOPOLOGY 

A continuing Interest In the problem of representing topological 
properties In binary language has led to a procedure for generating 
strings over (0,1) with topological interpretations. It appears 
applicable to topologica) spaces admitting both a measure and a metric. 
It is convenient to take both as bounded, with the strings associated 
with it Interpretable as binary decimals i.e.; real positive numbers 
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between 0 and 1. The teimlnatlng decimals represent a countable 
basis of neighborhoods, the Infinite declnals represent polntb of 
the space, and the chain of neighborhoods represented by arr|gglng 
all the n-dlglt strings In numerical order constitutes the n 
approximation to a space-filling curve. It Is possible to de|jLne 
a metric equivalent to that originally given for which this n 
approximation has length (1-2* ) . The space-filling curve then has 
unit length, can be used to "coordinatlze" the entire space 1-dlmen- 
slonally, and thereby to Induce a natural 1-dlmenslonal metric. 
Obviously there are neighboring points, according to the original 
metric, which are separated by finite distance according to the 1-D 
metric, but this does not detract from the usefulness of the coordl- 
natlzation to name the points of th.'i space. The non-l-D nature of 
the space is taken into account by specifying closure or convergence 
relations in the set of infinite binary strings. While every infinite 
binary sequence defines a unique point, and points correspond to 
unique sequences almost always, there is an everywhere dense cotmtable 
subset of points defined by more than one sequence, analogous to the 
1-D Identity in the limit of .01000... and .00111.... The number of 
sequences to be so identified depends on dimensionality. The results 
suggest a possible new approach to dimension theory as well as the 
possibility of existence of topological spaces with a generalized 
dimensionality associated with a set of integers rather than a single 
integer. The reader is invited to experiment wljh tje following 
highly illuminating case. The space is a 30 -60 -90 triangle in 
the Euclidean plane with coordlnatizatlon assigned by iterated dissec- 
tions using altitudes dropp.id from right angle to hypothenuse, the 
smaller triangle being 0, the larger 1. As in the Helne-Borel theorem, 
every infinite binary string defines a unique point. Verify the 
properties earlier discussed. 

J. Rothstein 



COMPARISON OF ROUTINES FOR SOLVING NONLINEAR SYSTEMS OF EQUATIONS 

The object of this research is to parameterize nonlinear systems 
of equations so that one can more effectively select which routine is 
best suited for solving a given nonlinear system. Parameters under 
consideration include the size of the system, the goodness of the 
initial approximation to the solution, the nonlinear Ity of the system, 
and the number and interplay of the relative extreme of the component 
functions of the system. Based on the above criteria, a model has 
been developed which will provide the basis for comparing production 
routines for solving nonlinear jys terns of equations and evaluating 
their effectiveness with respect to systems based oa the above 
parameters . 

D. S. Kerr, L. J. White, D. L. Kalmey 
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MINIHUH-REDUNDANCY VARIABLE LENGTH COOING 

In 19S2| Huffman provided an alegant algorithm for coding non«- 
equlprobable maaaages ualng alphabets of equal cost letters. There 
Is also a well-known efficient algorithm, called the extension technique , 
for co<^lng equlprobable messagea ualng binary alphabets of unequal 
cost letters. To date we are not aware of the exlatence of a simple 
algorithm for the general case of coding non->equlprobable messages 
tislng unequal coat coding letters. Nevertheless, the problem has been 
attacked by a number of people^ Including Karp who provided an Integer 
programming solution, and Vam, who obtained a selective enumeration 
technique. 

Our research Is aimed at unifying the simple techniques for 
equlprobable and equal cost cases of this general problem. As a 
first step we have obtained an algorithm for coding equlprobable 
messages using two or more unequal ^.ost coding letters. Unlike 
Vam*s algorithm, this algorithm Is a stral^tforward generalization 
of the extension technique. We have also obtained a few preliminary 
results that may lead to a better characterization of the problem, 
and ultimately a reasonably simple algorithm for the general case. 

L. J. White, V. Santhanam 



SIEVES AND CONTEXT SENSITIVE LANGUAGE 

It has been shewn that the well-known procedure known as the 
sieve of Eratosthenes, and a class of generalized sieves, can be 
Implemented on a linear bounded automaton* This leads to simple 
proofs that not only do the prime numbers, In any base representation, 
constitute a context sensitive language, but many other languages as 
well. Examples are the **lucky" numbers studied by Ulam and collaborators, 
the self-avoiding randoi walks on general lattices, and many others. A 
similar analysis proves context sensitivity for numbers satisfying 
arbitrary systems of linear recursion equations (e.g., the Fibonacci 
numbers). Though the procedures considered are certainly Included both 
in those it is Intuitive to regard as sieves and those which correspond 
to context sensitive languages, it Is not clear whether the two concepts 
are coextensive, one is a special case of the other, or whether they 
merely overlap, and If the latter, what kinds of non-CS recursive 
languages might be recognized by sieves. 

J. Roths teln 



SOME L^-NORM MAXMIN PROBLEMS: ALGORITHMS AND APPLICATIONS 

Two closely related maxmln problems are under Investigation, 
one Involving the Euclidean norm, and the other a rectilinear norm. In 
each case a point is to be located In a given convex polyhedron to 
maximize the minimum distance from a given set of points In the 
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polyhedron. The rectilinear problem is shown to be a linear program- 
ming problem with m x n constraints where m is the number of points 
and n is the dimensionality of the space. The Euclidean problem is 
shown to be a nonconvex programming problem, specifically one for 
which a finite candidate solution set can be generated. It has been 
demonstrated that the well known duality approaches cannot be used 
to obtain an efficient algorithm for the general n-dlmensional problem. 
However, an efficient algorithm is proposed for the Euclidean problem 
in the plane. The case where the convex polyhedron is also the 
convex hull of the given points is considered. The potential relevance 
of the problem to location theory and pattern classification is under 
study. An extension of the algorithms to the case of weighted norms 
is also under investigation. 

L. J. White, B. Dasarathy 



A STUDY OF THE OPTIMAL PLACEMENT OP KNOTS FOR CUBIC SPLINE APPROXIMATIONS 

If the number and position of knots are specified, an optimal 
approximation of a given function by a cubic spli.ue can be effected 
either in the sense of mean square error or the (hebychen uniform norm. 
This research has investigated the optimum position of a given number 
of knots, and the variation of the above error criteria with the 
number of knots. This research was facilitated by a computer prog-am 
developed by C. DeBoor and J. Rice, which ir^^atively obtains an 
approximation of the optimal knot locations in order to minimise 
the mean square error for a given function. One interesting result 
obtained is that for a large class of functions, as the number of 
knots is increased, there is a marked decrease in error as a critical 
number of knots is approached, rather than a gradual decrease as 
might be expected. This was true even for quadratic and cubic poly- 
nomial functions tested, where error rates were quite low, as would 
be expected for a cubic spline approximation. It was found that the 
extrema of the second derivative provide a good guide for the place- 
ment of knots, and the curvature of the second derivative gives an 
indication of how many knots should be utilized. This curvature 
analysis also indicates the critical number of knots for which a 
substantial drop in the error occurs, and beyond which negligible 
decrease in error is obtained. It is hoped these studies will yield 
simple intuitive guidelines for a good choice of knot locations for 
cubic spline approximations of a given function. 

L. J. White, S. Muftic, P. Wong 



TCWARD AN ARITHMETIC FOR CELLULAR i^-'TOMATA AND PARALLEL COMPUTATION 

Earlier work on patterns generf.ted by groupoid string operations 
led to the idea of trying to construct a number system without carrying, 
based on the cyclic group of order ? It has been shown that requiring 
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the number system to be a Gray Code (which la implied by the no-* carry 
property), with the daughter string representing a number double that 
represented by the original string (considered as a finite non'*sero 
portion of a doubly infinite string) determines the complete number 
system (up to a mirror reflection, i.e., a convention on direction of 
reading). Many interesting properties of this encoding of the integers 
have been discovered. All non«»integral numbers are represented by 
infinite strings, but only those corresponding to terminating binary 
decimals can yet be evaluated. Uniqueness has also been proved for 
the ternary system based on the cyclic group of order three. For 
higher bases it is not known whether uniqueness up to isomorphism is 
provable. 

J. Roths tein 



TURING UNIVERSALITY AND PARALLELISM OF GROUPOID STRING ALGORITHMS 

Earlier work on patterns and algorithms made use of strings over 
groupoid alphabets, where a ^'daughter string" is generated from a 
given string, letter by letter, using the groupoid algorithm on 
contiguous pairs of letters of the given string. Using a one-»dimensional 
cellular automaton permits parallel computation of a daughter string 
in a single step. Explicit constructions have been given to specify 
a groupoid algorithm equivalent to an arbitrary one-dimensional (1-D) 
cellular automaton (C.A.) and to specify a 1-D C.A. equivalent to an 
arbitrary Turing machine. Successive Turing machine configurations 
are represented by successive generations of daughter strings. This 
Turing universality of groupoid algorithms has been shown to be real-* 
izable with a simple modification of an infinite shift register, natnely, 
where a shifted copy of the entire string is combined with an unshifted 
version letter by letter (by the groupoid algorithm) in the register 
cells. This computation is completely parallel in nature and is 
accomplished in a single step. A one-^step parallel procedure for 
finding a "parent" string (or showing that none exists) has also been 
given, as has a "speed-up theorem". The latter constructs, from any 
groupoid, another groupoid in which the kth descendant of the string 
over the original groupoid is computed in parallel in one step. 

J. Roths tela 



36 



IX. SYSTEMS PROGRAMMING 



CONTEXT PROTECTION AND CONSISTENT CONTROL IN DATA SECURE SYSTEMS 

In data secure systensi the basic unit of information for 
protection is called a data unit, which may be a single data item 
or a collection of data items having the same properties with respect 
to access control* 

Context protection means that the same data unit may be protected 
differently in different contexts. For example, the same data field 
may be protected differently in two different records. The difference 
may be determined by the manner in which the fields and records are 
being accessed. 

Consistent coi ^rol is concerned with the problem that when new 
data units based on the old data units of the data baise are created 
by the users, these data units may be protected consistently in the 
sense that their access requirements m\ist be generated automatically 
and be conformed with the access requirements of the old data units. 

Both context protection and consistent control can be enforced 
by means of certain built<-in relations among the data units involved. 
These relations can reveal any violation of conto.xt protection and 
consistent control. A graph- theoretic approach is being used for 
this study. 

D. Hsiao, C. J. Nee, D» Wong (Sponsor: Office of Naval Research. 
N00014-67-A-0232-0022) 

A CROSS-ASSEMBLER FOR THE MICRODATA 1600/21 COMPUTER SYSTEM 

The objective of this project is to develop an assembler for 
the Microdata 1600/21 written in PL/I on the IBM 370/165. The 
assembler will accept an assembly program in card Images and produce 
a source program listing and a hexadecimal object program deck. 

A preprocesser will be Included that will support conditional 
assembly and a macro language processing. 

D. K. Hsiao, T. Rodeheffer (Sponsor: The Ohio State University. 
Instructional and Research Computer Center and Department of 
Computer and Information Science) 
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CRYPTOGRAPHIC TECHNIQUES AND DATA SECURITY IN DATA BASE SYSTEMS 

The importance of data security is well recognized in and out 
of computer science. Current techniques, such as "access control" 
are inadequate for protecting against all threats to privacy which 
exist in computer systems. As Peterson and Turn pointed out, the 
only safeguard against some of these threats is "cryptocraphic 
transformations". 

Cryptography has been used exteasively in communication systems 
but its implementation in computer systems require different consider- 
ations. The main ones are: the long "eye" of the data and the necessity 
to process the datp -fn the computer system. This leads to the idea 
of processable" ciphers which allow processing of the data in its 
ciphered form. The first goal of our research is to find such 
ciphers, which provide reasonable security on the one hand and 
processing capability on the other hand. 

Since cryptographic transformations affect mainly the physical 
representation of the data, the subject is strongly connected to the 
problem of organizing and describing data bases. The problems of 
implementing cryptography as part of current data definition languages 
will be shown. We will also show the ways to expand these DDL's to 
include cryptographic transformations. Our second goal of the research 
is to find how to apply cryptographic transformations effeciently and 
how they affect the organization and description of data bases. In 
looking at the problem of cryptography and data bases we found that 
several other problems, related to security, exist in current descrip- 
tion of data base systems. One of them is the centralization of all 
security information in only one part of the data base, which may 
cause a threat to its security. Another problem is that security 
checks in general and cryptographic transformations in particular 
cause a significant overhead and we are concerned about the efficiency 
of such measures. The third goal of our research is to build a general 
model of a data base system which incorporates security considerations 
into it and which can demonstrate the tradeoff between security 
requirements and processing efficiency. 

E. Gudes, H. S. Koch, F. Stahl 



A DATA BASE MANAGEMENT SYSTEM WITH GENERALIZED DATA STRUCTURE, ACCESS 
AND UPDATE 



In a contemporary data base management system (DBMS) only a 
restricted number of record organizations, file structures, access 
and update techniques are supported. Of these that are supported, 
each is considered as a special case, with completely different 
software routines being used. The purpose of this research is to 
design and implement an experimental data base management system 
for generalized record organization and file structure and for 
unified access and update. 
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With this DBMS we can then study the efficiency and effectiveness 
of data base management systems issues and prepare the ground work 
for the study of data security and integrity problems in data base 
management systems. 

H. S. Koch, B. J. Kammerer, R. Knablein, H. Sieberth (Sponsor: 
Office of Naval Research. N00014-6 7- A-02 32-0022) 



DATA SECURE SYSTEM ARCHITECTURE 

The overall goal of this research is the development of a computer 
(hardware) architecture which facilitates the implementation of data 
base management systems with logical access control capabilities. In 
the first phase of the work we investigated the implementation feasibility 
of different types of data base protection. A data base model consisting 
of objects, relations among objects and relations among these relations 
was used to characterize different protection strategies. TVo basic 
protection strategies evolved: concealment protection and information- 
theoretic protection. A concealment-oriented mechanism protects 
objects and relations by selectively preventing user access to certain 
objects. The restrictiveness of a protection mechanism may be charac- 
terized as the number on nonexplicitly protected items that must also 
be protected to insure that the specified security conditions are 
maintained. Various classes of concealment protection were examined 
with respect to their implementation complexity and degree of restric- 
tiveness . Concealment protection may be divided into two broad 
categories: static protection and dynamic protection. Dynamic protec- 
tion unlike static protection requires an access history. The various 
degrees of dynamic protection are characterized by the amount of 
history information that must be maintained and the wa-' derivable 
relations are used. 

The simplest form of dynamic protection prevents effective access 
to a derivable relation by preventing access to those objects which 
characterize the protected relation. A more complex form of dynamic 
protection prevents access to objects and relations which may be 
used to derive the protected relations. Information-theoretic 
protection is based on a model of the inference capabilities of the 
user. Such a protection mechanism dynamically determines the 
probability that relations are known to the user. This probability 
is determined by considering all queries asked by the user, all 
specified security conditions and all inferences assumed available 
to the user. Protection is provided by specifying a threshold 
probability on each relation which the system may not allow the user 
to cross. Information-theoretic protection is by far the most 
complex form of protection, but potentially provides the least 
restrictiveness. The next stage of the work will select those 
protection strategies that are mostfeasible and design a computer 
architecture which provides them the context of a data base management 
facility. 

D. K. Hsiao, R. Baum (Sponsor: Office of Naval Research N00014-6 7- 
A-0232-0022) 
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A DATA SECURE SYSTEMS MODEL 

A multi-level model Is developed to allow the different problems 
In data security to be considered at a level of abstraction appropriate 
to the specific Issue under study. The highest level Is conceptual. 
In it, "patterns of protection" (Intuitively, the ways In which the 
users may access the data) can^be defined In formal and unanblguous 
ways. The Intermediate level of the model Is structural. Here, the 
prlmatlves and structures needed to realize the protection patterns 
of the conceptual level are defined. The role of a protection 
system Is to prevent the completion of Illegal access attempts. 
This can be thought of as being done by the creation of security 
deadlocks between offending users and "pseudo-users". The common 
capability list and access list protection mechanisms ftay thus be 
viewed as simply different techniques to detect these security dead- 
locks. Another aspect of the structural model Is the definition of 
logical data base structures which are more amenable to the require- 
ments of data secure systems. On the lowest level of the model, a 
system to illustrate the utility and practicality of the other, 
higher levels is being created. 

D. K. Hsiao, E. J. McCauley (Sponsor: Office of Naval Research. 
NOOO lA-6 7- A-02 32-0022 ) 



DESIGN AND IMPLEMENTATION OF A DATA BASE PROTECTION MECHANISM 

The goal is to design and implement a protection mechanism for 
a data base system. In this system, protection violations can be 
classified into two types, intentional and nonintentional. An inten- 
tional violation is an explicit request by the user for data which 
are denied by the protection pattern. In this case the user is 
deadlocked by the mechanism. In a nonintentional violation the user 
is allowed to request data, some of which may be denied by the 
protection pattern. In this case the user is not deadlocked. He 
will recieve those data which are not denied. 

The design is aimed to show that the working of the mechanism 
in the case of Intentional violations can be characterized \.y the 
theory of deadlock and in the case of nonintentional violations by 
the practice in authority item and capability list systems. 

D. K. Hsiao, N. Kaffen (Sponsor: Office of Naval Research. NOOOIA- 
67-A-0232-0022) 



DESIGN AND IMPLEMENTATION OF A METHOD FOR CONTROLLING COMPUTER PROGRAM- 
POSTMORTEM DIAGNOSTIC OUTPUT 



Debugging, or the correcting of programming errors, is one of 
the major problems in implementing a large computer software system. 
This project is to design and implement a postmortem debugging aid 
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which is controlled by the program prior to failure and outputs the 
most relevant information upon termination. The central part of 
this proposed debugging aid is the program control block. This 
block will contain fields of information useful for debugging. The 
objective of this project is to design and implement a postmortem 
debugging aid controllable by the program being tented; and to show 
that the use of this software tool significantly increases the 
debuggability of common assembly language programs • 

R. F. Ma this 9 M. S. Friedman 



THE DEVELOPMENT OF GUIDELINES FOR DATA BASE REORGANIZATION 

The major objective is the development of general guidelines 
and procedures that will allow the need for data base reorganization 
to be recognized and that will assist in the reorganization process. 
Specifically^ the objectives are to: 

1. Examine, identify and classify the different types of data 
processing installations using Data Base Management Systems 
(DBMSs). 

2. Identify and classify system limitations and Data Base 
Administrator (DBA) and user requirements with respect to 
data base reorganization giving the DBA the framework and a 
set of guidelines with which to work. 

3. Identify the types of commands that need to be in the Data 
Definition Language (DDL) for data base reorganization and 
specify what needs to be added to the Data Manager to 
analyze and identify the need for data base reorganization. 

4. Identify and develop portions of the Data Manager that will 
give DBAs more flexibility with the DBMS they have chosen. 

U. S. Koch (Sponsor: The Ohio State University. Graduate School) 

FILE PARTITIONING AND RECORD PLACEMENT IN ATTRIBUTE-BASED FILE 
ORGANIZATIONS 

In this research a method is suggested which provides for improved 
accessing efficiencies by recognizing the desirability of logical file 
partitioning. With such a technique the number of different secondary 
storage cells (e.g., pages in virtual memory systems) accessed is mini- 
mlzed. File partitioning dictates \he record placement in the attribute 
based file organizations. It is the aim of this research to relate the 
results to the logical file partitioning problem in the CODASYL DDLC 
and to the enhancement of protection and access control in shared data 
base systems. 



D. K. Hsiao, F. Manola, E. J. McCauley (Sponsor: Naval Research 
Laboratory and Office of Naval Research N00014-67-A-0232-0022) 
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GRAPHICAL MODEL OF FUNCTIONAL DEPENDENCIES IN A RELATIONAL DATA BASE 

A relational model for data base management systems has been 
proposed to allow the user to express logical relationships among 
the data without having to be concerned with design and implementation 
details such as storage structures. In this model information is 
represented as sets of relations. Various normal forms of relations 
have been suggested in order to make the sets of relations easier 
to understand and control, simpler to operate upon and more informative 
to the casual user. When there are many relations in a data base, 
there are many possible choices of normalized relations. Functional 
dependencies among the various attributes affect the choice of rela- 
tions. A graphical model is being developed to express these depend- 
encies so that the best set of relations may be selected. 

D. S. Kerr (Sponsor: Office of Naval Research. N00014-67-A-0232-0O22) 



IMPROVED ERROR DIAGNOSTICS FOR IBM SYSTEM /370 OS/MVT ASSEMBLER 
PROGRAM DUMPS 

A major problem with assembler program dumps as they now exist 
is that they leave the inexperienced (and sometimes experienced) 
programmer in the dark as to the actual cause for the program's 
abnormal termination. In many cases the error cause is only loosely 
connected to the system message produced with the dump, and the 
programmer has no alternative but to turn to a consultant for debugging 
help. To remedy this situation, a postmortem debugging system has 
been developed to provide the programmer with improved error diagnostics 
and debugging hints upon abnormal termination of his program. Contents 
of relevant registers and storage locations are analyzed, along with 
the program's object code and system control blocks, to arrive at the 
most probable cause(s) of the exception. The programmer is not deluged 
with a great deal of what he considers to be useless output. The 
system currently handles program interruptions (types OCX), but will 
be expanded to cover many other error types. Additional diagnostic 
features, including selective facilities, are being developed. 

R. F. Mathis, B. M. Kirsch 



INVESTIGATION OF CRYPTOGRAPHIC TECHNIQUES FOR COMPUTER SECURITY 
AND PRIVACY 

The objectives of the research are to develop methods by which 
sensitive data can be made secure on computers in order to protect 
individual privacy. These include: 

1. Development of the necessary mathematical framework within 
which the analysis of cryptographic techniques for providing 
computational security can be performed. 
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2. Investigation of existing known cryptographic technliiues. 

3. Determination of the applicability of the known techniques 
to existing problems In computational security. 

A. Investigation of new techniques or extension of existing 
techniques to provide additional computational security. 

F. Stahl (Sponsor: The Ohio State University. Graduate School) 

INVESTIGATION OF PATTERN RECOGNITION TECHNIQUES AS APPLIED TO 
PROBLEMS OF COMPUTER SECURITY 

Cryptographic techniques have been proposed and Implemented 
as one means of protecting sensitive information In computer systems. 
Crypt analysis, the name for both the scientific study of the security 
level of proposed crypto systems, and, also, the art of breaking 
crypto systems, has been almost totally Ignored with regard to such 
systems during their design. 

An investigation of the techniques of pattern recognition that 
can be used to aid in cryptanalysis has been undertaken. Clearly, 
an encrypted message lays bare before the eyes of the **enesQr** all 
the Information that is to be concealed, less, of course, the logical 
key. All the original redundancy and patterns Invoked through the 
use of the language of the original message are present and merely 
need to be recognized by the clever analyst. 

F. Stahl (Sponsor: Air Force Office of Scientific Research 72-2351) 



MAPPING FUNCTIONS FOR INDEPENDENT DATA DEFINITION 

The schema-subschema concept has evolved to facilitate Independent 
data definition. The subschema concept has not been present in data 
base managem^t systems because it is not understood how to Interface 
the schema with a subschema. 

The purpose of this research is to study the effect of where 
the mapping functions are placed, what mapping functions are needed 
and whether subschemas can be derived from other subschemas. 

The goal of this research is not to standardize subschemas, but 
to discover good design principles for the Data Manager and new 
mapping algorithms for certain classes of data. 

H. S. Koch 



A MODEL FOR ACCESS AND ACCESS CONTROL IN DATA BASE MANAGEMENT SYSTEMS 

The goal of the research is to propose a model in which funda- 
mental issues in the use of data structures, in the development of 
data access strategies, and in the employment of both data structures 
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and access strategies for access control purposes may be studledt 
The model Is a generalization of previous models by the Investigators 
as well as the work by others on similar models. It Is hoped that 
by studying the model, data base system problems and Issues may be 
better understood. Furthermore, systems which make use of data models 
with complicated structure (e.g., network-structured data based on the 
CODASYL specifications) and systems which use a content^based approach 
with simple structures (e.g., relational models) may be compared 
and contrasted In the generalized model. 

Mr. Manola, a Visiting Research Associate, Is taking a leave- 
of-absence from the Naval Research Laboratory. 

D. K. Hsiao, F. Manola (Sponsor: Naval Research Laboratory) 



PAGINATION OF PROGRAMS FOR VIRTUAL MEMORY SYSTEMS 

Pagination for virtual memory systems of modular programs with 
certain restrictions on their control flow properties has been 
Investigated from two points of view: 1) If the module size Is 
greater than the page size, each module Is broken up Into basic 
blocks which are then allocated to pages; and 2) If the module 
size Is less than the page size, entire modules are allocated to 
pages* 

Algorithms based on complle-tlme loop analysis have been developed, 
for the Instruction portion of such programs, which minimize the page 
fault rate In each of these cas^s; for a given page size, the effec- 
tiveness of the algorithms is a decreasing function of memory size 
and, for a given memory size, the effectiveness of the algorithms Is 
an Increasing function of page size. 

A trace-driven simulation using the XPL compiler-compiler shows 
that application of these algorithms has the expected effect on the 
page fault rate. 

C. R. Foulk, D. n. Y. Su 



PARALLELISM IN COMPUTER PROGRAMS 
. 

In 1966 Hellerman proposed a computer design with several arithmetic 
units Instructed from a single sequential program to perform several 
computations in parallel. One of the early results of the present 
research has been a technique for the minimization of resources required 
for minimum time evaluation of algebraic expressions on this Hellerman 
computer. The generalization of this result to branch-free programs 
depends on a result of Aho, Sethi, and Ullman, that storage conflicts 
resulting from multiple uses of a variable name can be avoided by 
systematic renaming of variables. Identification of independent uses 
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of variable names has been shown feasible in smooth programs, that is, 
programs composed of basic blocks combined into two-terminal networks 
by the operations of concatenation, alternation, and iteration. In 
a smooth program it is possible to find for each computation a set of 
appropriate basic blocks. TWo computations either share such a set 
or have disjoint sets. The technique for resource irinimisation of 
branch-free programs applies to each set of appropriate basic blocks, 
and thus gensraliees to smooth programs. Algorithms for the automatic 
application of the technique are being devised. 

C. R. Foulk, 0. C. Juelich 



THE PL/X SYSTEMS PROGRAMMING LANGUAGE 

TVie PL/X language is a high-level systems programming language 
for the DEC- 10 computer. Its syntax is, to a large degree, that of 
PL/I. It also possesses some of the features available in IBM's 
systems programming language, PL/S (among them, the DO-CASE Construct). 
Being specifically designed for the DEC configuration, the PL/X 
language also allows specifications for octal constants (as opposed 
to binary bit strings) and provides in-line facility for MACRO- 10 
code (dec's assembler language). The PL/X compiler runs on the IBM 
370 computer. The compiled object programs are physically transported 
to the DEC-10 system for execution. Tlte run-time monitor PLXMON, 
which supervises the execution, has recently been rewritten and upgraded. 
Consequently, the manual entitled The PL/X Compiler - Subsystem W riters' 
Manual is being replaced with an updated version. 

Plans are made to upgrade the PL/X compiler and the language 
reference manual. The incorporation of some diagnostic facility 
into the compiler will also be included. 

D. K. Hsiao, B, J. Kammerer, W. Young (Sponsor: The Ohio State 
University. Instructional and Research Computer Center and Department 
of Computer and Information Science) 



PROVING PROGRAM CORRECTNESS ON COMPLEX FILE STRUCTURFS 

By using the inductive assertion approach to prove program 
correctness, this research will expand upon the work of Hoare, 
Bus tall, and Good, who proposed the assertion approach originally. 
By expanding the work of Laventhal to a complex, but well definea 
file structure proposed by Hsiao and Harary, proof of programs 
operating on this file structure can be inductively determined. 

Primitive concepts and other abstractions will be related to 
the structure so that inductive assertions can be made that will 
prove the correctness of the program by which the file structure 
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Is manipulated. In doing so It Is believed that the procedure for 
verification of data base management systems programs may one day 
be automated. It Is also believed that by analyzing the assertion 
to actual program relationship, concise and structured code can be 
generated and that this code will be error-free. 

D. K. Hsiao, B. Horger (Sponsor: Office of Naval Research N00014- 
6 7-A-02 32-0022) 



A SMOOTHNESS INDEX FOR COMPUTER PROGRAM FLOWGRAFHS 

The ultimate objective of thld research Is to define an Index 
of smoothness for computer flowgraphs and then to determine whether 
or not there Is any correlation between this Index and other proper- 
ties of such flowgraphs. 

A graph graiiinar, SG, Is defined. The productions of SG corre- 
spond to the basic and familiar operations oft 

concatenation " normal sequencing 
alternation > If- then-else 
Iteration " looping 

The "language" produced by this grammar Is the set of all flowgraphs 
which can be generated by these three operations only. Such a flow- 
graph Is called smooth . • 

A non-smooth flowgraph contains at least one sub-flowgraph which 
Is not generable by SG. In this case, It Is possible to transform 
the original flowgraph Into a smooth one. 

Transformation of a non-smooth flowgraph to a smooth one always 
Increases the cardinality of the arcs of the flowgraph. By computing 
the ratio of the cardinality of the arcs before the tt-, ans formation 
to the cardinality of the arcs after the transformation, an Index of 
smoothness S (0 < S <. 1) caii be obtained. 

A computer program which obtains the Index of smothness of any 
given flowgraph has now been written. This program will be used to 
obtain the Index of smoothness of randomly selected flowgraphs from 
algorithms published In Communications of the ACM. It will then be 
determined whether or not there Is any correlation between the Index 
of smothness of these flowgraphs and their other properties. 

C. R. Foulk, 0. Longe 



TEACHING DEBUGGING 

This project Is concerned with the integration of debugging 
concepts Into programming courses. What effect does programming 
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style or structured programming have on debuggablllty? What information 
Is necessary to debug a program? What examples best Illustrate debugging 
concepts and procedures? One seminar course was held this year and 
two more are planned for next year to study this problem* A course 
outline and set of examples are being developed for beginning programming 
courses* 

R. F. Mathis, C. Hasbrouck 



USE OF AN ERROR ANALYSIS CONTROL BLOCK (EACB) TO CONTROL PROMPTING 
IN A POSTMORTEM DEBUGGING SYSTEM 

The most common postmortem debugging aid Is the system supplied 
core dump* To make the Information In the dump more useful to the 
programmer/debugger, a dump analysis prompting system Is being 
developed. In this system the core contents are put on disk storage 
rather than printed. The programmer /debugger Is able to Investigate « 
this version of a dump through a time sharing terminal. While placing 
control blocks and core areas on disky an error analysis control block 
(EACB) is created which contains relevant flags and pointers from 
other control blocks and which will be used to control prompting when 
the dump is investigated. Prompting messages Include as many error 
analysis and correcting suggestions as possible. As ne^^ error causes 
and error analysis methods are discovered they can be seml-automatically 
added to the system. 

R. F. Mathis, R. J. Atwell, Jr. 
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X. COMPUTER NETWORKS 



DISTRIBUTED OPERATING SYSTEMS FOR COMPUTFR NETWORKS 

This research Is concerned with distributed network opei.^itlng 
systeob. In the sarje way microcode Is used to define the basic 
machine behavior, the operating system Is tised to lAplenent a meta- 
machlne. The function of an operating system is then to unify, 
enrich and control the hardware. In the context of network, an 
operating system Is used In the same manner. The network operating 
system Is xised to minimize the user-vlslble differences between host 
systems. A conventional operating system controlling a unl-processor 
or a localized multi-processor Is modular, layered, with no redundancy. 
This type of structure offers very little possibilities in the 
construction of a network operating system. Standard systems have 
been designed on the assumption that every module is directly acces- 
sible without delay. However the concept of distributed operating 
system implies that modules may be geographically far apart and thus 
not inmediat-ely available to the controller. In this study a new 
approach is considered for the design of operating systems. Also, 
in a conventional operating system, there is no redundancy whereas, 
in the proposed system, redundancy is a key factor for permanency. 
This feature implies that a major decision of the concept of operating 
system has to be undertaken to enable multiple copies o£ the same 
module to interact in order to achieve a common goal. 

M. T. Liu, S. Foumler 



EMULATION OF COMPUTER NETWORKS BY MICROPROGRAMMABLE MICROCOMPUTERS 

The advent of low-cost, sophisticated, mlcroprogrammable, LSI 
microprocessors has renewed Interest in multiple-computer systems. 
This research suggests a method of implementing microprammable micro- 
computer systems as a sophisticated tool (emulators) for decreaslne 
the economical risk Involved in development of a large computer net- 
work. Two levels of emulation are proposed for different networking 
configurations. The Intel 8080 microprocessors are selected for 
implementing the systems described. As a special case where mutually 
exclusive data are processed, the, proposed systems can be changed 
to parallel processing systems for parallel execution of computer 
programs at different levels. 

M. T. Liu D. Cohen (Sponsor: National Science Foundation. GN 53A.1 
and GN 27A58) 
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LOCAL COMPUTER NETWORK 

The objective of this project is to implement a local computer 
network interconnecting IBM 370/165, DECsystem-lO and Micro 1600/21 
computer systems. The interconnected systems can serve as a basis 
for the study of computer network and facilitate better utilization 
of all the computers involved. 

A configuration with low cost and adequate performance has been 
proposed. This configuration calls for an utilization of the Micro 
1600 as a communication processor, a synchronous communication inter- 
face between the Micro 1600 and IBM 370/165 and an asynchronous 
communication interface be twee.; the Micro 1600 an I DEC System- 10. 
The particular interconnection not only will impact minimally the 
existing operating systems of the computers involved but also will 
rely exclusively off-the-shelves hardware components. A cost esti- 
mation of the needed hardware components and software development is 
also included in the proposal. 

A design of. the software network system is underway. 

D. K. Hsiao, B. Farah, T. Rodeheffer (Sponsor: The Ohio State University. 
Instructional and Research Computer Center and Department of Computer 
and Information Science) 

MIXED DATA/VOICE TRANSMISSION SYSTEM FOR COMPUTER COMMUNICATIONS 

This research is concerned with the development of a feasible 
and inexpensive method for computer communications by embedding digital 
data signals into a collective set of analog voice channels during 
the inactive periods of voice conversation. There are two ways of 
doing this: 

1. interleaving or embedding the digital signal into the voice 
circuit during the silent period of the voice conversation; or 

2. transmitting concurrently or superimposing the digital signal 
onto the voice signa.l. 

Minicomputer /microcomputer technology is used in the control of the 
digital signal embedding procedure. The goals of the research are: 

1. to lower the link cost for computer communications; 

2. to improve the usage efficiency of bandwidth in the existing 
voice communication network. 

M. T. Liu, J. T. Wang 
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OPTIMAL PROCESS ALLOCATION IN DISTRIBUTED H^^.TEROGENEOUS COMPUTER 
NETWORKS 

This research is concerned with optimal process allocation in 
distributed heterogeneous computer networks so as to minimise hardware/ 
software redundancy. A mathematical model is proposed for analysis 
using integer and dynamic pro^jramming which allows different service 
rates for each process. A simulation model is to be implemented to 
verify the results. It is exp^ected that the goal of minimising 
redundancy can not only increase overall system efficiency, but 
can also self-tune the system according to the constantly changing 
request pattern. 

M. T. Liu, T. T. Cheng 



VARIABLE-LENGTH MESSAGE TRANSMISSION FOR DISTRIBUTED LOOP COMPUTER 
NETWORKS 

This research is concerned with the development of new protocols 
for variable- length message transmission in distributed loop computer 
networks. An improved variable-length message transmission scheme is 
proposed, and a conceptual model is developed of its operation. The 
model illustrates a method by which the nodal interface can delay 
incoming messages by haruware buffering just long enough for an out- 
going message to be placed on the loop; advantage is taken of gaps 
betwi^en messages to clear out the delay buffer and to make room for 
future outgoing messages. Also, some possible hardware realizations 
of the model are developed. The new scheme has the effects of: 

1. reducing the time a message must wait queued at the node 
before being transmitted, 

2. increasing the utilization of the loop, and 

3. reducing variations in the instantaneous load factor of the 
loop. 

It is expected that the proposed scheme will yield a transmission 
technique superior to fixed- length framing currently used by most loop 
computer networks. 

M. T. Liu, C. C. Reames (Sponsor: National Science Foundation. GN 534^1 
and GN 27458) 
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. JOINT PROGRAMS 

INVESTIGATION OF COMPUTER SCIENCE EDUCATION IN LATIN AMERICA 

An investigation has been initiated into the present level of 
Computer Science Education in Latin America. Of particular interest 
is the scope of the programs* at Latin American Universities and the 
extent to which these universities are meeting the demand for computer 
specialists as a result of the rapid increase in the number of computers 
being utilized in their respective countries. Data will be collected 
through use of questionnaires and a field trip. Analysis will be 
performed upon this information and recommendations will be made for 
future development. The results of this study will be published. 

Questions asked are concerned with the present level of programs 
In the computer area (including ntunbers of students, level of studies , 
placement of graduates, language requirements, hardware, and courses 
offered), level of faculty (including level of education, country of 
education, and research interest), future plans for expansion of 
programs, and sources of support (both financial and technical). 

A field trip to South America will be undertaken during the latter 
part of this year. In addition to gathering information abouc the 
educational programs the principle investigator will advise and assist 
in the development of graduate studies and research in the Computer 
Science Department of Unlversidad Simon Bolivar, Caracas, Venezuela. 

F. Stahl, A. Pumarada (Sponsored in part: UNESCO. 610.247) 
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APPENDIX A 

COMPUTER AND INFOBMATION SCIENCE COURSE LISTING 
BY NUMBER AND TITLE 

100 Computers in Society 

201 Elementary Digital Computer Prograninlng 

211 Computer Data Processing I 

212 Computer Data Processing II 

221 Programming and Algorithms I 

222 Programming and Algorithms II 
294 Group Studies 

311 Introduction to File Design and Analysis 

411 Design of On-Line Systems 

422 Topics in Computing for Engineers 

494 Group Studies 

505 Fundamental Concepts of Computer and Information Science 

509 Survey of Computer and Information Science for High School 
Teachers 

541 Survey of Numerical Methods 

542 Introduction to Computing in the Humanities 

543 Intermediate Digital Computer Programming 

548 Digital Computer Programming for High School Teachers 

549 Numerical Analysis for High School Teachers 

550 Introduction to Information Storage and Retrieval 
555 Survey of Programming Lai ,aage8 

594 Group Studies 

610 Principles of Man-Machine Interaction 
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640 Numerical Analysis 

641 Computer Systems Programming I 

642 Numerical Linear Algebra 

643 Linear Optimization Techniques In Information Processing 

644 Advanced Computer Programming 
652 Modeling of Information Systems 

675 Digital Computer Organization 

676 Minicomputer Evaluation and Selection 

677 Computer Networks 
680 Data Structures 

693 Individual Studies 

694 Group Studies 

705 Mathematical Foundations of Computer and Information Science 

706 Information Theory in Behavioral Science 
712 Man-Machine Interface 

720 Introduction to Linguistic Analysis 

726 Theory of Finite Automata 

727 Turing Machines and Coin)utablllty 

728 Topics in Theory of Computing 

730 Basic Concepts in Artificial Intelligence 
735 Statistical Methods in Pattern Recognition 

740 Computer Systems Programming II 

741 Comparative Operating Systems 

745 Numerical Solution of Ordinary Differential Equations 

746 Advanced Numerical Analysis 

750 Modem Methods of Information Storage and Retrieval 
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751 Fundamentals of Document-Handling Information Systems 

752 Techniques for Simulation of Information Systems 

753 Theory of Indexing 

754 Language Processing for Information Storage and Retrieval 

755 Programming Languages 

756 Compiler Design and Implementation 

760 Selected Topics in the Mathematics of Information Handling 

765 Theory of Management Information Systems 

775 Advanced Computer Organization 

780 File Structures 

781 Aspects of Computer Graphics Systems 

788 Intermediate Studies in Coiq)uter and Information Science 

793 Individual Studies 

794 Group Studies 

797 Interdepartmental Seminar 

805 Information Theory in Physical Science 

806 Cellular Automata and Models of Cooq>lex Systems 
812 Computer and Information Science Research Methods 
820 Computational Linguistics 

835 Special Topics in Pattern Recognition 

840 Operating System Implementation 

845 Numerical Solution of Partial Differential Equations 

850 Theory of Information Retrieval I 

851 Theory of Information Retrieval II 

852 Design and Analysis of Information Systems Simulation 
855 Formal Theory of Programming Languages 
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865 Seminar on Sodo-Psychologlcal Aspects of the Information Sciences 

880 Advanced Theory of Computablllty 

888 Advanced Studies In Computer and Information Science 

889 Advanced Seminar In Computer and Information Science 
899 Interdepartmental Seminar 

994 Group Studies 

999 Research 
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APPENDIX B 



COMPUTER AND INFORMATION SCIENCE FACULTY 



Marshall C. Yovits, Ph.D., (Yale University). 

Professor and Chairman of Department of Computer and Information 
Science and Professor of Electrical Engineering. Director, C.I.S. 
Research Center. Information systems, theory of the flw of infor- 
mation, self-organiztng systems. 

Ranko Bojanic, Ph.D., (Mathematical Institute of the Serbian Academy 
of Science). 

Professor of Computer and Information Science and Professor of 
Mathematics. M***-"^ - 7^tical analysis, theory of approximation. 

Richard I. Hang, M.S., (The Ohio State University). 

Professor of Computer and Information Science and Professor of 
Engineering Graphics. Computer graphics, engineering application 
of computers. 

Clyde H. Kearr.s, M.S., (Tlie Ohio State University). 

Professor of Computer and Information Science and Professor of 
Engineering Graphics. Computer graphics, engineering application 
of computers. 

Robert B. McGhee, Ph.D., (University of Southern California). 

Professor of Computer and Information Science and Professor of 
Electrical Engineering. Control theory, switching theory, logical 
design. 

Harold B. Pepinsky, Ph.D., (University of Minnesota). 

Professor of Computer and Information Science and Professor of 
Psychology. Clinical and socio-^cultural psychology. 

Roy F. Reeves, Ph.D., (Iowa State University). 

Professor of Computer and Information Science and Professor of 
Mathematics. Director, Instruction and Research Conqputer Center. 
Numerical analysis and programming. 

Jerome Rothstein, A.M., (Columbia University) . 

Professor of Computer and Information Science and Professor of 
Biophysics. Informational problems in science, methodology, 
biocybemetics. 

Charles Saltzer, Ph.D., (Brown Univeristy). 

Professor of Computer and Information Science and Professor 
of Mathematics. Coding theory, numerical analysis, automata 
theory . 
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Hugh C. Atkinson^ M.A.> (University of Chicago). 

Associate Professor of Library Administration and Director of 
Libraries. Library on-*llne automatloni data processing for 
libraries^ technical service processing models. 

Kenneth J. Breeding^ Ph.D.» (University of Illinois). 

Associate Professor of Coiq>uter and Information Science and 
Associate Professor of Electrical Engineering. Computer organization 
and Switching Theory. 

Balakrishnan Chandrasekarany Ph.D.^ (University of Peunsylvanla) . 

Associate Professor of Computer and Information Science. Pattern 
recognition and artificial intelligence » learning automata 
theory^ finite memory decision theory^ and game theory. 

Ronald L. Ernst^ Ph.D.^ (University of Wisconsin). 

Associate Professor of Computer and Information Science and 
Associate Professor of Psychology. Htunan performance theory 
and engineering^ complex information processing and systems 
evaluation. 

Clinton R. Foulk^ Ph.D., (University of Illinois). 

Associate Professor of Computer and Information Science. 
Programming languages, systems programming, programming heuristics. 

David K. Hsiao, Ph.D., (University of Pennsylvania). 

Associate Professor of Computer and Information Science. Systems 
programming, information storage and retrieval systems, file 
systems, data base management systems, access control and privacy 
protection of data, data definition language and processor, 
system architectures. 

Douglas S. Kerr, Ph.D., (Purdue University). 

Associate Professor of Computer and Information Science. Numerical 
analysis and programming. 

Gerald J. Lazorlck, Ph.D., (State University of New York at Buffalo). 
Associate Proiessor of Computer and Information Science and 
Associate Professor of Library Administration. Director, 
Mechanized Information Center. Information storage and retrieval, 
library systems: design and analysis. 

Ming-Tsan Liu, Ph.D., (University of Pennsylvania). 

Associate Professor of Computer and Information Science. Computer 
organization, switching and automata theory, mathematical program- 
ming, computer architecture, pseudo-Boolean programming, threshold 
logic. 

Anthony E. Petrarca, Ph.D., (University of New Hampshire). 

Associate Professor of Computer and Information Science. Automatic 
indexing, chemical structural Information processing, automated 
search systems, other aspects of information storage and retrieval. 
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James B. Randels, Ph.D., (The Ohio State University). 

Associate Professor of Computer and Information Science and 
Assistant Director, Learning Resources Computer Center. Computer 
operating systems and utilities, telecommunications applications, 
subroutine libraries, programming languages. 

James E. Rush, Ph.D., (University of Missouri). 

Adjunct Associate Professor of Computer and Information Science. 
Indexing theory, automated language processing, organization of 
information, and parallel processing. 

Celianna I. Taylor, B.S.L.S., (Graduate School of Library Science, 
Case-Western Reserve University). 

Senior Research Associate and Associate Professor of Library 
Administration. Information dissemination wystems, information 
centers, library systems and management. 

Lee J. White, Ph.D., (University of Michigan). 

Associate Professor of Computer and Information Science and 
Associate Professor of Electrical Engineering. Mathematical 
programming, data structures, organization of information. 

Ronald L. Wigington, Ph.D., (University of Kansas). 

Adjunct Associate Professor of Computer and Information Science 
and Director of R. & D. , Chemical Abstracts Service. Computer 
system design. 

Alan W. Biermann, Ph.D., (University of California, Berkeley). 

Assistant Professor of Computer and Information Science. Theory 
of computer systems, formal systems. 

H. William Buttelmanu, Ph.D., (University of North Carolina). 

Assistant Professor of Computer and Information Science. Automata 
theory, computer architecture and programming languages. 

Thomas G. DeLutis, Ph.D., (Purdue University). 

Assistant Professor of Computer and Information Science. Design 
and evaluation of information systems, systems programming. 

Harvey S. Koch, Ph.D., (Pennsylvania State University). 

Assistant Professor of Computer and Information Science. Data 
definition language, data base management, programming languages 
and compiler design. 

Frederick S. Koehl, Ph.D., (The Ohio State University). 

Adjunct Assistant Professor of Computer and Information Science 
and Math Analyst, Instruction and Research Computer Center. 
Sorting techniques, topological groups, compiler design. 

Anthony P. Lucido, Ph.D., (Iowa State University). Appointed Autuim 197A. 
Assistant Professor of Computer and Information Science. Computer 
architecture, compiler design, interactive computer graphics. 
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Robert F. Mathis, Ph.D., (The Ohio State University). 

Assistant Professor of Computer and Information Science and 
Assistant Dean and Secretary of the Graduate School. Programming 
languages I ntimerlcal analysis. 

Daniel J. Moore, Ph.D., (University of Kansas). Appointed Autumn 1974. 
Assistant Professor of Computer and Information Science. Complexity 
theory, recUrsion theory, artificial intelligence, 

Norman K. Sondheimer, Ph.D., (University of Wisconsin). Api;ointed Autumn 1974. 
Assistant Professor of Computer and Information Science. Natural 
language processing within artificial intelligence, information 
storage and retrieval, systems programming. 

Frederick A. Stahl, Ph.D., (University of Illinois). 

Assistant Professor of Computer and Information Science. 
Computational security, cryptography, information retrieval, 
computers in the humanities, and in the law, artificial intelligence. 

Stuart H. Zweben, Ph.D., (Purdue University). Appointed Autumn 1974. 

Assistant Professor of Computer and Information Science. Programming 
languages, compilers, data structures, operating systems. 

Donald L. Kalmey, M»S., (The Ohio State University). 
Counselor. 

Ernest Staveley, B.S., (U.S. Naval Postgraduate School). 

Administrative Assistant and Assistant Director, C.I.S. Research 
Center. 
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APPENDIX C 

COMPUTER AND INPOKMATION SCIENCE SEMINAR SERIES 



August 9, 1973 "The Use of Probabilistic Automata as Models of Some 
Aspects of Human Memory," Stephen Hepler, Ph.D. Candidate, 
The Ohio State University. 

August 23, 1973 "Human Learning of Cyclic Finite State Languages," 
James Beug, Ph. D. Candidate, The Ohio State University. 

October 4, 1973 "Computer Enhancement of Electron Micrographs," 
Bum L. Lewis, University of Calliomla, Berkeley. 

October 18, 1973 "Computer Facilities at Ohio State," Weldon E. 
Ihrlg, Chairman, Computer Center Coordinating Commltteei Roy 

F. Reeves, Director, Instruction and Research Computer Center; 
Robert F. Mathls, Chairman, Computer and Information Science 
Computer Committee, The Ohio :>tate University. 

October 25, 1973 "Programming at i - The Programming Process from 
Product Design Through Produv.. upport," Larry Foster, Programming 
Center Manager, IBM Systems Development Division, Galthersburg, 
Maryland. 

November 1, 1973 "Computers and Medieval Data Processing," Thomas 

G. Whitney, Instructional tnd Research Computer Center, The 
Ohio State University. 

November 6, 1973 "Up In the Air about Data Base Management?" Captain 
Thomas Carroll, Chief of Information Systems Section, Wright- 
Patterson Air Force Base. 

November 8, 1973 "Computer Technology and the Problem of Privacy," 
Philip M. Burgess, HEW Committee on Privacy and Automated 
Personal Data Systems, Mershon Professor of Policy Science, 
Professor of Political Science, The Ohio State University. 

November 15, 1973 "Time Flow Mechanisms for Digital Logic Simulation," 
Cliff W. Hemming, Jr., Bell Telephone Laboratories, Coluiid)us, 
Ohio. 

November 20, 1973 "Formal Studies in Music Composition," Otto E. 
Laske, Studio for Electronic Music, University of Utrecht, 
Utrecht, Netherlands. 

November 29, 1973 "Programming and Debugging: There Must be a Better 
Way," Alan W. Blermann, Assistant Professor, The Ohio State 
University. 

January 10, 1974 "Problems In Interactive Bibliographic Files," 
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Frederick G. Kllgour, Executive Director, The Ohio College Library 
Center, Columbus, Ohio. 

January 17, 1974 "Software Physics," Maurice H. Halstead, Professor 
of Computer Sciences, Purdue University. 

January 24, 1974 "Symmetric Ternary Radix for Computer Arithmetic," 
Paul S. Y. Chan, Ph.D. Graduate, The Ohio State University. 

January 31, 1974 "The Interactive Generation of Facial Images on a 
CRT Using a Heuristic Strategy," Mark L. Glllenson, Ph.D. 
Candidate, Department of Computer and Information Science; and 
Graduate Research Associate, Computer Graphics Research Group, 
The Ohio State University. 

February 7, 1974 "Development of a Tlroe-Sharlng Utility," Dr. John 
R. Goltz, Senior Vice President, Compu-Serv Network, Columbus, 
Ohio. 

February 19, 1974 "The Application of Some Graph Theoretical Methods 
of Semantics Information Processing," Stephen L. Taylor, Computer 
Science Department, Northwestern University. 

February 21, 1974 "The Internal Structure of Algorithms," Stuart 
H. Zweben, Ph.D. Candidate, Computer Sciences Department, 
Purdue University. 

February 26, 1974 "Extraction of Speech Fomants Using a Micro- 
programmed Parallel-Processor System," Michael J. Knudsen, 
Department of Computer Science, Camelgle-Mellon University. 

February 27, 1974 "The Computational Complexity of Oracular Enumeration 
Procedures," Daniel J. Moore, Computer Science Department, 
University of Kansas. 

February 28, 1974 "Some Problems of Spatial Reference for Language 
Understanding Systems," Norman Sondhelmer, Ph.D. Candidate, 
Computer Sciences Department, University of Wisconsin. 

March 5, 1974 "Design and Implementation of a Data Conversion Language 
for Networks," Michael Schneider, Computer Science Department, 
University of Wlsconson. 

March 7, 1974 "Load Sharing of Batch Jobs: A Queueing Model : a 
Computer Network," Carl Landwehr, Computing Center, University 
of Michigan. 

March 12, 1974 "A Model for the Evaluation and Optimization of 
File Designs," Bing Yao, Computer and Communication Sciences 
Department, University of Michigan. 
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March 12, 1974 "A Model for the Evaluation and Optimization of 
File Designs/' Bing Yao, Computer and Communication Sciences 
Department, University of Michigan. 

March 14, 1974 "The Granniatical Inference of Formal Languages," 
Jan Beaujont North Carolina State University at Raleigh. 

March 26, 1974 "A Homogeneous Network for Data Sharing," R. W. 

Peebles, Computer Communications Network Group, University of 
Waterloo. 

March 27, 1974 "On the META-T Compiler-Compiler," Anthony Lucido, 
Iowa State University. 

March 28, 1974 "A Graphics Extension to SN0B0L4 for Recognition and 
Manipulation of Line Drawings," Linda Shapiro, University of 
Iowa. 

April 2, 19/4 "Computer Display of Curved Surfaces," Edwin Catmull, 
University of Utah, Salt Lake City, Utah. 

April 4, 1974 "Computing Reliability of Complex Systems," Aron 
Rosenthal, Department of Electrical Engineering and Computer 
Science, University of California at Berkeley. 

April 8, 1974 "Biographical Information," William E. Riddle, Assistant 
Professor, Computer and Commurdcation Sciences Department, 
University of Michigan. 

April 22, 1974 "The Design of an Integrated Medical Record Information 
System," Kevin O'Kane, Assistant Professor of Computer Science 
and Assistant Professor of Obstetrics and Gynecology, Pennsylvania 
State University. 

April 23, 1974 "Some Results on Computability of Phrase Structure 
Languages," H. William Buttelmann, Assistant Professor of 
Computer and Information Science, The Ohio State University. 

May 2, 1974 "Protecting Ones Proprietary Interest in Software - 
Which Way to Turn?" James A. Sprowl, Lecturer, School of Law, 
Northwestern University, Partner, Lawfirm of Mason, Kolehmalnen, 
Rathbum and Wyss. 

May 23, 1974 "Fourth Generation Computer Architecture," Harold S. 
Schwenk, Honeywell Information Systems. 
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APPENDIX D 
REUTKD ACTIVITIES OF THE STAPF OF 
COMPUTER AND INFORMATION SCIENCE RESEARCH CENTER 



J. 6. Abllock presented "Comparison of Relational Data Management 
Systems" (Co-author: V.K.M. Whitney), and "A Quantitative 
Measure for the Information Contained in a Data Set or a Message" 
(Co-author: M.C. Yovits) at the Computer Science Conference, 
Detroit, Michigan, February 11-14, 1974. 

J. A. Altken presented the paper "Performance Ev£. >:ation of Information 
Systems: Data Base Qiaracterization Facilities" at the Computer 
Science Conference, Detroit, Michigan, February 11-14, 1974. 

H. C. Atkinson was a consul uunt for Library Automation at Penn State 
University, College ParK, Pennsylvania, July 19-21, 1973. 

H. C. Atkinson presented "Will Computers Depersonalize Library Service?" 
State University of New York, Albany, Septesiber 21, 1973. 

H. C. Atkinson served as Panel Chairman of "Data Banks and Librnries" 
section of EDUCOM Fall 1973 Council Meeting and Conference on 
"Facts and Futures: What's Happening Now in Computing for Higher 
Education" at the Educational Testing Service, Princeton, New 
Jersey, October 9-11, 1973. 

H. C. Atkinson conducted a seminar for library staff at the University 
of Tennessee, Knoxville, Tennessee, May 15, 1974. 

R. J. Atwell, Jr. presented the paper "Use of an Error Analysis 
Control Block (EACB) to Control Prcrapting in a Post Mortem 
Debugging System" at the Computer Science Conference, Detroit, 
Michigan, February 11-14, 1974. 

A. W. Biermann; F. Petry attended the Third International Joint 
Conference on Artificial Intelligence, Stanford University, 
Stanford, CalifomJa, August 20-23, 1973. 

A. W. Biermann; R. Baum; R. Krishnaswamy; F. Petry made a 10 minute 
color movie entitled Autoprogrammer , March 1974 . 

A. W. Biermann gave seminar talks entitled "Constructing Programs 
from Example Computations" and showed the mo^rie Autoprogrammer 
at Duke University and the University of Texas. 

H. W. Buttelraann presented "Servantic Directed Translation of Context 
Free Languages" at the annual meeting of the Association for 
Computational Linguistics, at the University of Michigan, 
August 2, 1973. 
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H. W. Buttelmann presentad a paper entitled "Applications In Automated 
Language Processing" at the 1974 Annual Meeting of the Ohio 
Academy of Sciences at Wooster, Ohio, April 24-26, 1974. 

J. Chandler presented the paper "Performance Evaluation of Information 
Systems: Simulation Time Facilities and Statistics" at the 
Computer Science Conference, Detroit, Michigan, Fe%ruary 11,-14, 1974. 

B. Chandrasekaran was a member of the panel on "Pattern Recognldon: 
Bridging the Gap Between Theory and Implementation" as part of 
the International Joint Conference on Pattern Recognition, 
Washington, D. C, October 30, 1973. 

B. Chandrasekaran was the organizer and chairman of the Norbert 

Wiener Commemorative Symposium, Boston, Massachusetts, November 7, 
1973. 

B. Chandrasekaran was Chairman of the session on Artificial Intelligence 
I, at the Computer Science Conference, Detroit, Michigan, 
February 11-14, 1974. 

B. Chandrasekaran presented "A Deterministic Finite Memory Algorithm 
for the Symmetric Hypothesis Testing Problem" at the IEEE 
International Information Theory Symposium, Ashkelon, Israel, 
June 28, 1974. 
• (Co-author: C.C. Lam) 

B. Chandrasekaran has been elected a Vice-President of the Systems, 
Man and Cybernetics Society of the Institute of Electrical and 
Electronics Engineers. 

B. Chandrasekaran was a reviewer for the IEEE Transactions Information 
Theory Information and Control, ComputlnR Reviews , 1973-1974. 

B. Chandrasekaran reviewed proposals for the National Science Foundation 

Office of Computing Activities and NSF Engineering Division for 
19 73-19 

T. G. DeLutls presented "Recent Developments in Data Description 

Languages" for the Central Ohio Chapter Association for Computing 
Machinery, Columbus, Ohio, October 8, 1973. 

T. G. DeLutls presented "Performance Evaluation of Information Systems: 
Tlie AUXSYM Simulator" at the Computer Science Conference, Detroit, 
Michigan, February 11-14, 1974. 

C. R. Foulk presented "Structured Programming in Unstructured Languages" 

at the Computer Science Conference, Detroit, Michigan, February 11-14, 
1974. 
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C, Foulk served on the organizing committee of the annual meeting 
of the Association for Computing Machinery » Special Interest 

Group on Computer Science Education, Detroit , Michigan, February 14-15, 
1974. 

D. Hsiao was invited to attend the IBM Data Security Sytqpslum 
and participated In the workshop on Issues related to data 
security, New York, October 29 - November 1, 1973, 

D. K. Hsiao presented the paper ''A Unified Approach to Structure, 

Access and Update In Data Base Systems" at the European Computing 
Congress held at Brunei University, London, England, May 14, 1974, 

D« K« Hsiao and nine undergraduate and gradtiate students from the 
Department of Computer and Information Science have recently 
completed the design and Implementation of the first PL/1-llke 
compiler language for the DECsystem-lO computer. The compiler 
language Is called the PL/X whose syntax Is to a large extent 
the syntax of the PL/1. 

S. James presented the paper **A Sequential Approach to Automatic 

Document Classification and Retrieval" at the Computer Science 
Conference, Detroit, Michigan, February 11-14, 1974. 
(Co-authors: L.J, White, J. Rush, D, Westbrook, G. Jones, G. Kar) 

D. L, Kalmey presented ^'Numerical Experimentation with Routines for 
Solving Non- linear Systems of Equations'' at the Computing 
Machinery Annual Conference, Atlanta, Georgia, August 27-29, 1973. 
(Co-authors: D.S. Kerr, L.J, White) 

D. L. Kalmey presented the paper ''Routines for Solving Nonlinear 
Systems of Equations" at the Computer Science Conference, 
Detroit, Michigan, February 11-14, 1974, 
(Co-authors: D.S* Kerr, L.J, Wiilte) 

C. Kearns has been elected to serve a three-year term as circulation 

manager-treasurer of the Engineering Design Graphics Jomal, 

D. S. Kerr attended the Computer Science Conference, Detroit, Michigan, 

February 11-14, 1974. 

D, S. Kerr served as Chairman of the ^innual meeting of the Association 
for Computing Machinery, Special Interest Group on Computer 
Science Education In Detroit, Michigan, February 14-15, 1974, 

D. S. Kerr attended the ACM-SIGFIDET Workshop on Data Description, 
Access and Control, Ann Arbor, Michigan, May 1-3, 1974, 

D. S. Kerr was a psiir^llst for "Research In Data Security - Policy 
and Projects' !»J the National Computer Conference, Chicago, 
Illinois, May 6-10, 1974. 
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D, S, Kerr waa appointed Chairman of the Central Ohio Chapter of 
the Association for Cnnputlng Machinery. 

B. M, Klrsch presented the paper "Inqtroved Error Diagnostics for 

IBM System/ 370 OS/MVT Assenbler Program Duiq>s" at the Computer 
Science Conference, Detroit Michigan, February 11-14, 1974. 

K. S. Koch presented the paper "Data Base Reorganization Tools" at 

the Coiqputer Science Conference, Detroit, Michigan, February 11-14, 
1974. 

U. S. Koch was accepted as a member of the Subschema Task Group of 
the CODASYL Data Definition Language Committee. 

M. J, Lee presented "Design Specifications for an Information Retrieval 
System Management Game" at the Association for Computing Machinery 
Annual Conference, Atlanta, Georgia, August 27-29, 1973. 
(Co-author: G.J. Lazorick) 

0. Louge presented the paper "An Index of Smoothness of Program 

Flowgraphs" at the Computer Science Conference, Detroit, Michigan, 
February 11-14, 1974. 

R. F. Mathls chaired one of the Research Abstracts Sessions at the 
Association for Computing Machinery Annual Conference, Atlanta, 
Georgia, August 27-29, 1973. 

R. F. Mathls attended the organizational meeting of Local Users 

Group of DECUSO, Pittsburgh, Peansylvanla, September 11, 1973. 

R. F. Mathls was the chairman of the ASEE/OSU Crltchfleld Effective 

Teaching Institute, Hocking Hills State Park, Ohio, September 18-20, 
1973. 

R. F. Mathls was appointed Secretary of the Computer Science Conference 
Board at the Computer Science Conference, Detroit, Michigan, 
February 11-14, 1974. 

R. F. Mathls presented the paper "Program Debugging in a General 

Program Development Model" at the Computer Science Conference, 
Detroit, Michigan, February 11-14, 1974. 

R. F. Mathls presented "Teaching Debugging" at the Fourth SIGCSE 
Technical Symposium, Detroit, Michigan, February 14-15, 1974. 

R. F. Mathls was the leader of the ASEE Compus Activities Coordinators' 
Workshop, Mansfield, Ohio, March 1-2, 1974. 

R. F. Mathls attended the ASEE North Central Section Meeting, Ann 
Arbor, Michigan, April 19-20, 1974. 
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R. F. Mathis was the session chairman for ^^Research in Data Security - 
Policy and Projects" at the National Computer Conference, Chicago, 
Illinois, May 6-10, 197A. 

R. F. Mathis, R. LaRue presented "Computers In the Engineering 

Classroom" as part of the ASEE Box- lunch Forum, The Ohio State 
University, Columbus, Ohio, May 6, 1974. 

R. F. Mathis attended the CIC Graduate Deans meeting, Ann Arbor, 
Michigan, May 12-14, 1974. 

R. F. Mathis presented "Using the Computer to Teach Design and 

Testing" at the ASEE Annual Conference, Rensselaer Polytechnic 
Institute, Troy, New York, June 17-20, 1974. 

R, F. Mathis attended the Computer Science Department Chairmen's 
meeting. Salt Lake City, Utah, June 30 - July 3, 1974. 

R. F. Mathis was nominated to honorary membership In TEXNOKOI, an 
Honorary Activities Fraternity, of the College of Engineering, 
The Ohio State University. 

A. E. Petrarca presented the paper "Algorithmic Control of Index 
Term Specificity in Double-KWIC Coordinate Indexing Systems" 
at the Computer Science Conference, Detroit^ Michigan, February 11-14, 
1974. 

F. Petry presented "Program Synthesis from Memory Snapshots of 

Sample Computations" at the Computer Science Conference, Detroit, 
Michigan, February 11-14, 1974. 
(Co-author: A.W. Biermann) 

F. Petry gave a seminar talk entitled "Inference of Programs from 
Memory Traces" and showed the movie Autoprogrammer at Wayne 
State University, March 1974; and at the University of South- 
western Louisiana, April 8, 1974. 

A. Pyster presented the paper "The Role of the Nonterminal In Language 
Generation" at the Computer Science Conferency, Fatrolt, Michign, 
February 11-14, 1974. 
Co-authors: H.W. Buttelmann, L. Reeker) 

J. Roths tein presented "A Cellular Automaton for Pattern Recognition 
by Parallel Computation" (Co-author: C. Welman) and "A New 
Binary Code for Pattern Recognition by Parallel Computation" 
at the Sagamore Computer Conference on Parallel Processing, 
August 22-24, 1973 and at the Association for Computing Machinery 
Annual Conference, Atlanta, Georgia, August 29, 1973. 

J. Rothstein presented the paper "The Roths teln Straight Line Code 
as a Formal Language" at the Computer Science Conference, 
Detroit, Michigan, February 11-14, 1974. 
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J. Roths teln presented "Fission Products as an Energy Resource" 

before the American Physical Society, Philadelphia, Pennsylvania, 
March 25-28, 1974. 

F. A. Stahl presented "Semantics Modeling for Deductive Question- 
Answering" at the Third International Joint Conference on 
Artificial Intelligence, Stanford University, August 1973. 

F. A. Stahl attended the NBS conference on Privacy and Security in 
Computer Systems, November 19-20, 1973. 

F. A. Stahl presented a talk entitled "Conservation, Computers, and 
You" at the Computer Science Conference, Wayne State University, 
February 19 74. 

F. A. Stahl was awarded an Ohio State University Research Grant to 
investigate cryptographic techniques for computer security and 
privacy, June 1974. 

S. M. Strong had her paper "An Algorithm for Generating Structural 
Surrogates of English Text" chosen as the winner of the 1973 
American Society for Information Science (ASIS) Student Member 
Paper Contest. 

C. I. Taylor became a member of the Board of Trustees of the Columbus 
Regional Information Service (ORIS) serving the Columbus Area 
Chamber of Commerce and the Columbus area citizens, March 15, 1974. 

C. I. Taylor served as a consultant to the Directive Teaching Instructional 
Materials System (DTIMS) project on the development of a computerized 
data base, the Department of Exceptional Children, The Ohio State 
University, May - June 1974. 

C. I. Taylor attended the Planning Conference for NCEMMH/ALRC National 
Information Retrieval System, The Ohio State University, June 6-7, 
1974. 

C. I. Taylor prepared the work statement comprising the technical 

content of a Request for Proposal (RFP) regarding an information 
retrieval system for a national audience, serving the handicapped 
and requiring access to special education materials, June 1974. 

W. M. Wagner presented the paper "Asympototes of a Differential 

Equation of Ion Recombination" at the Association for Computing 
Machinery Annual Conference, Atlanta, Georgia, August 27-29, 1974. 
(Co-author: H.R. Wood) 

W. M. Wagner presented the paper "Asymptotes of a Differential 

Equation of Ion Recombination of an Arbitrary Number of Different 
Molecules" at the Computer Science Conference, Detroit, Michigan, 
Febniary 11-14, 1974. 
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R. Welman presented the paper ^^A Cellular Automaton for Pattern 
Recognition by Parallel Computation** at the Association for 
Computing Machinery Annual Conferencei Atlanta, Georgiai August 27-29| 
1973^ 

M« C, Yovits organized the Research Abstracts Sessions and chaired 
one of the sessions at the Association for Computing Machinery 
Annual Conference , Atlanta, Georgia » August 27-299 1973, 

C, Yovits presented a paper on ''A Quantitative Measure for the 
Information Contained in a Data Set or a Message** at the 
Association for Co]q>uting Machinery Annual Conference , Atlanta, 
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